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COMPOSITIONS, SPLICE VARIANTS AND METHODS 
RELATING TO COLON SPECIFIC GENES AND PROTEINS 

5 INTRODUCTION 

This application claims the benefit of priority from U.S. Provisional Patent 
Application Serial No. 60/431,143 filed December 4, 2002 and 60/431,206 filed 
December 4, 2002, which are herein incorporated by reference in their entireties. 

FIELD OF THE INVENTION 

10 The present invention relates to newly identified nucleic acids and polypeptides 

present in normal and neoplastic colon cells, including fragments, variants and derivatives 
of the nucleic acids and polypeptides. The present invention also relates to antibodies to 
the polypeptides of the invention, as well as agonists and antagonists of the polypeptides 
of the invention. The invention also relates to compositions comprising the nucleic acids, 

15 polypeptides, antibodies, post translational modifications (PTMs), variants, derivatives, 
agonists and antagonists thereof and methods for the use of these compositions. These 
uses include identifying, diagnosing, monitoring, staging, imaging and treating colon 
cancer and/or non-cancerous disease states in colon, identifying colon tissue and 
monitoring and identifying and/or designing agonists and antagonists of polypeptides of 

20 the invention. The uses also include gene therapy, therapeutic molecules including but not 
limited to antibodies or antisense molecules, production of transgenic animals and cells, 
and production of engineered colon tissue for treatment and research. 

BACKGROUND OF THE INVENTION 
Colorectal cancer is the second most common cause of cancer death in the United 

25 States and the third most prevalent cancer in both men and women. M. L. Davila & A. D. 
Davila, Screening for Colon and Rectal Cancer, in Colon and Rectal Cancer 47 (Peter S. 
Edelstein ed., 2000). The American Cancer Society estimates that there will be about 
105,500 new cases of colon cancer and 42,000 new cases of rectal cancer in 2003 in the 
United States. Colon cancer and rectal cancer will cause about 57,100 deaths combined. 

30 ACS Website: cancer.org on the world wide web. Nearly all cases of colorectal cancer 
arise from adenomatous polyps, some of which mature into large polyps, undergo 
abnormal growth and development, and ultimately progress into cancer. Davila at 55-56. 
This progression would appear to take at least 10 years in most patients, rendering it a 
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readily treatable form of cancer if diagnosed early, when the cancer is localized. Davila at 
56; Walter J. Burdette, Cancer: Etiology. Diagnosis, and Treatment 125 (1998). 

Although our understanding of the etiology of colon cancer is undergoing 
continual refinement, extensive research in this area points to a combination of factors, 
5 including age, hereditary and nonhereditary conditions, and environmental/dietary factors. 
Age is a key risk factor in the development of colorectal cancer, Davila at 48, with men 
and women over 40 years of age becoming increasingly susceptible to that cancer, 
Burdette at 126. Incidence rates increase considerably in each subsequent decade of life. 
Davila at 48. A number of hereditary and nonhereditary conditions have also been linked 

10 to a heightened risk of developing colorectal cancer, including familial adenomatous 

polyposis (FAP), hereditary nonpolyposis colorectal cancer (Lynch syndrome or HNPCC), 
a personal and/or family history of colorectal cancer or adenomatous polyps, inflammatory 
, bowel disease, diabetes mellitus, and obesity. Davila at 47; Henry T. Lynch & Jane F. 
Lynch, Hereditary Nonpolyposis Colorectal Cancer (Lynch Syndromes), in Colon and 

15 Rectal Cancer 67-68 (Peter S. Edelstein ed., 2000). 

Environmental/dietary factors associated with an increased risk of colorectal 
cancer include a high fat diet, intake of high dietary red meat, and sedentary lifestyle. 
Davila at 47; Reddy, B. S., Prev. Med. 16(4): 460-7 (1987). Conversely, 
environmental/dietary factors associated with a reduced risk of colorectal cancer include a 

20 diet high in fiber, folic acid, calcium, and hormone-replacement therapy in post- 
menopausal women. Davila at 50-55. The effect of antioxidants in reducing the risk of 
colon cancer is unclear. Davila at 53. 

Because colon cancer is highly treatable when detected at an early, localized stage, 
screening should be a part of routine care for all adults starting at age 50, especially those 

25 with first-degree relatives with colorectal cancer. One major advantage of colorectal 
cancer screening over its counterparts in other types of cancer is its ability to not only 
detect precancerous lesions, but to remove them as well. Davila at 56. The key colorectal 
cancer screening tests in use today are fecal occult blood test, sigmoidoscopy, 
colonoscopy, double-contrast barium enema, and the carcinoembryonic antigen (CEA) 

30 test. Burdette at 125; Davila at 56. 

The fecal occult blood test (FOBT) screens for colorectal cancer by detecting the 
amount of blood in the stool, the premise being that neoplastic tissue, particularly 
malignant tissue, bleeds more than typical mucosa, with the amount of bleeding increasing 
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with polyp size and cancer stage, Davila at 56-57. While effective at detecting early stage 
tumors, FOBT is unable to detect adenomatous polyps (premalignant lesions), and, 
depending on the contents of the fecal sample, is subject to rendering false positives. 
Davila at 56-59. Sigmoidoscopy and colonoscopy, by contrast, allow direct visualization 

5 of the bowel, and enable one to detect, biopsy, and remove adenomatous polyps. Davila at 
59-60, 61. Despite the advantages of these procedures, there are accompanying 
downsides: sigmoidoscopy, by definition, is limited to the sigmoid colon and below, 
colonoscopy is a relatively expensive procedure, and both share the risk of possible bowel 
perforation and hemorrhaging. Davila at $9-60. Double-contrast barium enema (DCBE) 

10 enables detection of lesions better than FOBT, and almost as well a colonoscopy, but it 
may be limited in evaluating the winding rectosigmoid region. Davila at 60. The CEA 
blood test, which involves screening the blood for carcinoembryonic antigen, shares the 
downside of FOBT, in that it is of limited utility in detecting colorectal cancer at an early 
stage. Burdette at 125. 

1 5 Once colon cancer has been diagnosed, treatment decisions are typically made in 

reference to the stage of cancer progression. A number of techniques are employed to 
stage the cancer (some of which are also used to screen for colon cancer), including 
pathologic examination of resected colon, sigmoidoscopy, colonoscopy, and various 
imaging techniques. AJCC Cancer Staging Handbook 84 (Irvin D. Fleming et al. eds., 5 

20 ed. 1998); Montgomery, R. C. and Ridge, LA., Semin. Surg. Oncol 15(3): 143-150 

(1998). Moreover, chest films, liver functionality tests, and liver scans are employed to 
determine the extent of metastasis. Fleming at 84. While computerized tomography and 
magnetic resonance imaging are useful in staging colorectal cancer in its later stages, both 
have unacceptably low staging accuracy for identifying early stages of the disease, due to 

25 the difficulty that both methods have in (1) revealing the depth of bowel wall tumor 

infiltration and (2) diagnosing malignant adenopathy. Thoeni, R. F., Radiol Clin. N. Am. 
35(2): 457-85 (1997). Rather, techniques such as transrectal ultrasound (TRUS) are 
preferred in this context, although this technique is inaccurate with respect to detecting 
small lymph nodes that may contain metastases. David Blumberg & Frank G. Opelka, 

30 Neoadjuvant and Adjuvant Tlierapy for Adenocarcinoma of the Rectum, in Colon and 
Rectal Cancer 316 (Peter S. Edelstein ed., 2000). 

Several classification systems have been devised to stage the extent of colorectal 
cancer, including the Dukes' system and the more detailed International Union against 
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Cancer- American Joint Committee on Cancer TNM staging system, which is considered 
by many in the field to be a more useful staging system. Burdette at 126-27. The TNM 
system, which is used for either clinical or pathological staging, is divided into four stages, 
each of which evaluates the extent of cancer growth with respect to primary tumor (T), 

5 regional lymph nodes (N), and distant metastasis (M). Fleming at 84-85. The system 
focuses on the extent of tumor invasion into the intestinal wall, invasion of adjacent 
structures, the number of regional lymph nodes that have been affected, and whether 
distant metastasis has occurred. Fleming at 81. 

Stage 0 is characterized by in situ carcinoma (Tis), in which the cancer cells are 

10 located inside the glandular basement membrane (intraepithelial) or lamina propria 

(intramucosal). In this stage, the cancer has not spread to the regional lymph nodes (NO), 
and there is no distant metastasis (M0). In stage I, there is still no spread of the cancer to 
the regional lymph nodes and no distant metastasis, but the tumor has invaded the 
submucosa (Tl) or has progressed further to invade the muscularis propria (T2). Stage II 

15 also involves no spread of the cancer to the regional lymph nodes and no distant 

metastasis, but the tumor has invaded the subserosa, or the nonperitonealized pericolic or 
perirectal tissues (T3), or has progressed to invade other organs or structures, and/or has 
perforated the visceral peritoneum (T4). Stage HI is characterized by any of the T 
substages, no distant metastasis, and either metastasis in 1 to 3 regional lymph nodes (Nl) 

20 or metastasis in four or more regional lymph nodes (N2). Lastly, stage IV involves any of 
the T or N substages, as well as distant metastasis. Fleming at 84-85; Burdette at 127. 

Currently, pathological staging of colon cancer is preferable over clinical staging 
as pathological staging provides a more accurate prognosis. Pathological staging typically 
involves examination of the resected colon section, along with surgical examination of the 

25 abdominal cavity. Fleming at 84. Clinical staging would be a preferred method of staging 
were it at least as accurate as pathological staging, as it does not depend on the invasive 
procedures of its counterpart 

Turning to the treatment of colorectal cancer, surgical resection results in a cure for 
roughly 50% of patients. Irradiation is used both preoperatively and postoperatively in 

30 treating colorectal cancer. Chemotherapeutic agents, particularly 5-fluorouracil, are also 
powerful weapons in treating colorectal cancer. Other agents include irinotecan and 
floxuridine, cisplatin, levamisole, methotrexate, interferon-a, and leucovorin. Burdette at 
125, 132-33. Nonetheless, thirty to forty percent of patients will develop a recurrence of 
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colon cancer following surgical resection, which in many patients is the ultimate cause of 
death. Wayne De Vos, Follow-up After Treatment of Colon Cancer, Colon and Rectal 
Cancer 225 (Peter S. Edelstein ed., 2000). Accordingly, colon cancer patients must be 
closely monitored to determine response to therapy and to detect persistent or recurrent 
5 disease and metastasis. 

The next few paragraphs describe the some of molecular bases of colon cancer. In 
the case of FAP, the tumor suppressor gene APC (adenomatous polyposis coli), 
chromosomally located at 5q21, has been either inactivated or deleted by mutation. 
Alberts et al., Molecular Biology of the Cell 1288 (3d ed. 1994). The APC protein plays a 

10 role in a number of functions, including cell adhesion, apoptosis, and repression of the c- 
myc oncogene. N. R. Hall & R. D. Madoff, Genetics and the Polyp-Cancer Sequence, 
Colon and Rectal Cancer 8 (Peter S. Edelstein, ed., 2000). Of those patients with 
colorectal cancer who have normal APC genes, over 65% have such mutations in the 
cancer cells but not in other tissues. Alberts et aL, supra at 1288. In the case of HPNCC, 

1 5 patients manifest abnormalities in the tumor suppressor gene HNPCC, but only about 1 5% 
of tumors contain the mutated gene. Id. A host of other genes have also been implicated 
in colorectal cancer, including the YL-ras, N-ras, H-ras and c-myc oncogenes, and the 
tumor suppressor genes DCC (deleted in colon carcinoma) and p53. Hall & Madoff, at 8- 
9; Alberts etal., at 1288. 

20 Abnormalities in Wg/Wnt signal transduction pathway are also associated with the 

development of colorectal carcinoma. Taipale, J. and Beachy, P. A. Nature 411: 349-354 
(2001). Wntl is a secreted protein gene originally identified within mouse mammary 
cancers by its insertion into the mouse mammary tumor virus (MMTV) gene. The protein 
is homologous to the wingless (Wg) gene product of Drosophila, in which it functions as 

25 an important factor for the determination of dorsal-ventral segmentation and regulates the 
formation of fly imaginal discs. Wg/Wnt pathway controls cell proliferation, death and 
differentiation. Taipal (2001). There are at least 13 members in the Wnt family. These 
proteins have been found expressed mainly in the central nervous system (CNS) of 
vertebrates as well as other tissues such as mammary and intestine. The Wnt proteins are 

30 the ligands for a family of seven transmembrane domain receptors related to the Frizzled 
gene product in Drosophila. Binding Wnt to Frizzled stimulates the activity of the 
downstream target, Dishevelled, which in turn inactivates the glycogen synthesase kinase 
3|3(GSK3p). Taipal (2001). Usually active GSK30 will form a complex with the 
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adenomatous polyposis coli (APC) protein and phosphorylate another complex member, 
P-catenin. Once phosphorylated, P-catenin is directed to degradation through the ubiquitin 
pathway. When GSK3P or APC activity is down regulated, P-catenin is accumulated in 
the cytoplasm and binds to the T-cell factor or lymphocyte excitation factor (TcfTLef) 
5 family of transcriptional factors. Binding of P-catenin to Tcf releases the transcriptional 
repression and induces gene transcription. Among the genes regulated by p-catenin are a 
transcriptional repressor Engrailed, a transforming growth factor-p (TGF-p) family 
member Decapentaplegic, and the cytokine Hedgehog in Drosophila. P-Catenin is also 
involved in regulating cell adhesion by binding to a-catenin and E-cadherin. On the other 

10 hand, binding of P-catenin to these proteins controls the cytoplasmic P-catenin level and 
its complexing with TCF. Taipal (2001). Growth factor stimulation and activation of c- 
src or v-src also regulate p-catenin level by phosphorylation of a-catenin and its related 
protein, pHO 0 **. When phosphorylated, these proteins decrease their binding to E- 
cadherin and p-catenin resulting in the accumulation of cytoplasmic P-catenin. Reynolds, 

15 A.B. et al. Mol Cell Biol 14: 8333-8342 (1994). In colon cancer, c-src enzymatic activity 
has been shown to be increased to the level of v-src. Alternation of components in the 
Wg/Wnt pathway promotes colorectal carcinoma development The best known 
modifications are to the APC gene. Nicola S et al. Hum. Mol Genet 10:721-733 (2001). 
This germline mutation causes the appearance of hundreds to thousands of adenomatous 

20 polyps in the large bowel. It is the gene defect that accounts for the autosomally 

dominantly inherited FAP and related syndromes. The molecular alternations that occur 
in this pathway largely involve deletions of alleles of tumor-suppressor genes, such as 
APC, p53 and Deleted in Colorectal Cancer (DCC), combined with mutational activation 
of proto-oncogenes, especially c-Ki-ras. Aoki, T. et al. Human Mutat 3: 342-346 (1994). 

25 All of these lead to genomic instability in colorectal cancers. 

Another source of genomic instability in colorectal cancer is the defect of DNA 
mismatch repair (MMR) genes. Human homologues of the bacterial mutHLS complex 
(hMSH2, hMLHl, hPMSl, hPMS2 and hMSH6), which is involved in the DNA mismatch 
repair in bacteria, have been shown to cause the HNPCC (about 70-90% HNPCC) when 

30 mutated. Modrich, P. and Lahue, R. Ann Rev. Biochem. 65: 101-133 (1996); and 

PeltomSki, P. Hum. Mol Genet 10: 735-740 (2001). The inactivation of these proteins 
leads to the accumulation of mutations and causes genetic instability that represents errors 
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in the accurate replication of the repetitive mono-, di-, tri- and tetra-nucleotide repeats, 
which are scattered throughout the genome (microsatellite regions). Jass, LR. et al. J. 
Gastroenterol Hepatol 17: 17-26 (2002). Like in the classic FAP, mutational activation of 
c-Ki-ras is also required for the promotion of MSI in the alternative HNPCC. Mutations 
5 in other proteins such as the tumor suppressor protein phosphatase PTEN (Zhou, X.P. et 
al. Hum. Mol Genet 11: 445-450 (2002)), BAX (Butfler, L.M. Aus. N. Z. J. Surg. 69: 88- 
94 (1999)), Caspase-5 (Planck, M. Cancer Genet Cytogenet 134: 46-54 (2002)), TGF(3- 
Rn (Fallik, D. et al. Gastroenterol Clin Biol 24: 917-22 (2000)) and IGFH-R 
(Giovannucci E. J. Nutr. 131: 3109S-20S (2001)) have also been found in some colorectal 
1 0 tumors possibly as the cause of MMR defect 

Some tyrosine kinases have been shown up-regulated in colorectal tumor tissues or 
cell lines like HT29. Skoudy, A. et al. Biochem J. 317 ( Pt 1): 279-84 (1996). Focal 
adhesion kinase (FAK) and its up-stream kinase c-src and c-yes in colonic epithelial cells 
may play an important role in the promotion of colorectal cancers through the extracellular 
15 matrix (ECM) and integrin-mediated signaling pathways. Jessup, J.M. et al., The 

molecular biology of colorectal carcinoma, in: The Molecular Basis of Human Cancer. 
251-268 (Coleman W.B. and Tsongalis GJ. Eds. 2002). The formation of c-src/FAK 
complexes may coordinately deregulate VEGF expression and apoptosis inhibition. 
Recent evidences suggest that a specific signal-transduction pathway for cell survival that 
20 implicates integrin engagement leads to FAK activation and thus activates PI-3 kinase and 
akt. In turn, akt phosphorylates BAD and blocks apoptosis in epithelial cells. The 
activation of c-src in colon cancer may induce VEGF expression through the hypoxia 
pathway. Other genes that may be implicated in colorectal cancer include Cox enzymes 
(Ota, S. et al. Aliment Pharmacol Ther. 16 (Suppl 2): 102-106 (2002)), estrogen (al- 
25 Azzawi, F. and Wahab, M. Climacteric 5: 3-14 (2002)), peroxisome proliferator-activated 
receptor-y (PPAR-y) (Gelman, L. et al. Cell Mol Life Set 55: 932-943 (1999)), IGF-I 
(Giovannucci (2001)), thymine DNA glycosylase (TDG) (Hardeland, U. et al. Prog. 
Nucleic Acid Res. Mol Biol 68: 235-253 (2001)) and EGF (Mendelsohn, J. Endocrine- 
Related Cancer 8: 3-9 (2001)). 
30 Gene deletion and mutation are not the only causes for development of colorectal 

cancers. Epigenetic silencing by DNA methylation also accounts for the loss of function 
of colorectal cancer suppressor genes. A strong association between MSI and CpG island 
methylation has been well characterized in sporadic colorectal cancers with high MSI but 
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not in those of hereditary origin. In one experiment, DNA methylation of MLH1, 
CDKN2A, MGMT, THBS1, RARB, APC, and pl4ARF genes has been shown in 80%, 
55%, 23%, 23%, 58%, 35%, and 50% of 40 sporadic colorectal cancers with high MSI 
respectively. Yamamoto, H. et al. Genes Chromosomes Cancer 33: 322-325 (2002); and 

5 Kim, K.M. et al. Oncogene. 12;21(35): 5441-9 (2002). Carcinogen metabolism enzymes 
such as GST, NAT, CYP and MTHFR are also associated with an increased or decreased 
colorectal cancer risk. Pistorius, S. et al. Kongressbd Dtsch Ges ChirKongr 118: 820-824 
(2001); and Potter, J.D. J. Natl Cancer Inst 91: 916-932 (1999). 

From the foregoing, it is clear that procedures used for detecting, diagnosing, 

10 monitoring, staging, prognosticating, and preventing the recurrence of colorectal cancer 
are of critical importance to the outcome of the patient. Moreover, current procedures, 
while helpful in each of these analyses, are limited by their specificity, sensitivity, 
invasiveness, and/or their cost As such, highly specific and sensitive procedures that 
would operate by way of detecting novel markers in cells, tissues, or bodily fluids, with 

1 5 minimal invasiveness and at a reasonable cost, would be highly desirable. 

Accordingly, there is a great need for more sensitive and accurate methods for 
predicting whether a person is likely to develop colorectal cancer, for diagnosing 
colorectal cancer, for monitoring the progression of the disease, for staging the colorectal 
cancer, for determining whether the colorectal cancer has metastasized, and for imaging 

20 the colorectal cancer. Following accurate diagnosis, there is also a need foT less invasive 
and more effective treatment of colorectal cancer. 

Growth and metastasis of solid tumors are also dependent on angiogenesis. 
Folkman, J., 1986, Cancer Research, 46, 467-473; Folkman, J., 1989, Journal of the 
National Cancer Institute, 82, 4-6. It has been shown, for example, that tumors which 

25 enlarge to greater than 2 mm must obtain their own blood supply and do so by inducing 
the growth of new capillary blood vessels. Once these new blood vessels become 
embedded in the tumor, they provide a means for tumor cells to enter the circulation and 
metastasize to distant sites such as liver, lung or bone. Weidner, N., et al, 1991, Hie New 
England Journal of Medicine, 324(1), 1-8. 

30 Angiogenesis, defined as the growth or sprouting of new blood vessels from 

existing vessels, is a complex process that primarily occurs during embryonic 
development The process is distinct from vasculogenesis, in that the new endothelial cells 
lining the vessel arise from proliferation of existing cells, rather than differentiating from 
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stem cells. The process is invasive and dependent upon proteolysis of the extracellular 
matrix (ECM), migration of new endothelial cells, and synthesis of new matrix 
components. Angiogenesis occurs during embryogenic development of the circulatory 
system; however, in adult humans, angiogenesis only occurs as a response to a 

5 pathological condition (except during the reproductive cycle in women). 

Under normal physiological conditions in adults, angiogenesis takes place only in 
very restricted situations such as hair growth and wounding healing. Auerbach, W. and 
Auerbach, R., 1994, Pliarmacol Ther. 63(3):265-3 11; Ribatti et al.,1991, Haematologica 
76(4):3 11-20; Risau, 1997, Nature 386(6626):67 1-4. Angiogenesis progresses by a 

10 stimulus which results in the formation of a migrating column of endothelial cells. 

Proteolytic activity is focused at the advancing tip of this "vascular sprout", which breaks 
down the ECM sufficiently to permit the column of cells to infiltrate and migrate. Behind 
the advancing front, the endothelial cells differentiate and begin to adhere to each other, 
thus forming a new basement membrane. The cells then cease proliferation and finally 

1 5 define a lumen for the new arteriole or capillary. 

Unregulated angiogenesis has gradually been recognized to be responsible for a 
wide range of disorders, including, but not limited to, cancer, cardiovascular disease, 
rheumatoid arthritis, psoriasis and diabetic retinopathy. Folkman, 1995, Nat Med 1(1):27- 
31; Isner, 1999, Circulation 99(13): 1653-5; Koch, 1998, Arthritis Rheum 41(6):951-62; 

20 Walsh, 1999, Rheumatology (Oxford) 38(2):103-12; Ware and Simons, 1997, Nat Med 
3(2): 158-64. 

Of particular interest is the observation that angiogenesis is required by solid 
tumors for their growth and metastases. Folkman, 1986 supra; Folkman 1990, J Natl. 
Cancer Inst., 82(1) 4-6; Folkman, 1992, Semin Cancer Biol 3(2):65-71; Zetter, 1998, Annu 

25 Rev Med 49:407-24. A tumor usually begins as a single aberrant cell which can proliferate 
only to a size of a few cubic millimeters due to the distance from available capillary beds, 
and it can stay 'dormant* without further growth and dissemination for a long period of 
time. Some tumor cells then switch to the angiogenic phenotype to activate endothelial 
cells, which proliferate and mature into new capillary blood vessels. These newly formed 

30 blood vessels not only allow for continued growth of the primary tumor, but also for the 
dissemination and recolonization of metastatic tumor cells. The precise mechanisms that 
control the angiogenic switch is not well understood, but it is believed that 
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neovascularization of tumor mass results from the net balance of a multitude of 
angiogenesis stimulators and inhibitors Folkman, 1995, supra. 

One of the most potent angiogenesis inhibitors is endostatin identified by O'Reilly 
and Folkman. O'Reilly et al., 1997, Cell 88(2):277-85; O'Reilly et aL, 1994, Cell 79(2):3 
5 1 5-28. Its discovery was based on the phenomenon that certain primary tumors can inhibit 
the growth of distant metastases. O'Reilly and Folkman hypothesized that a primary tumor 
initiates angiogenesis by generating angiogenic stimulators in excess of inhibitors. 
However, angiogenic inhibitors, by virtue of their longer half life in the circulation, reach 
the site of a secondary tumor in excess of the stimulators. The net result is the growth of 

10 primary tumor and inhibition of secondary tumor. Endostatin is one of a growing list of 
such angiogenesis inhibitors produced by primary tumors. It is a proteolytic fragment of a 
larger protein: endostatin is a 20 kDa fragment of collagen XVIII (amino acid HI 132- 
K1315 in murine collagen XVIII). Endostatin has been shown to specifically inhibit 
endothelial cell proliferation in vitro and block angiogenesis in vivo. More importantly, 

1 5 administration of endostatin to tumor-bearing mice leads to significant tumor regression, 
and no toxicity or drug resistance has been observed even after multiple treatment cycles. 
Boehm et al., 1997, Nature 390(6658):404-407. The fact that endostatin targets genetically 
stable endothelial cells and inhibits a variety of solid tumors makes it a very attractive 
candidate for anticancer therapy. Fidler and Ellis, 1994, Cell 79(2): 185-8; Gastl et al., 

20 1997, Oncology 54(3):177-84; Hinsbergh et al., 1999, Ann Oncol 10 Suppl 4:60-3. In 
addition, angiogenesis inhibitors have been shown to be more effective when combined 
with radiation and chemotherapeutic agents. Klement, 2000, J. Clin Invest, 105(8) R15- 
24. Browder, 2000, Cancer Res. 6-(7) 1878-86, Arap et al., 1998, Science 279(5349):377- 
80; Mauceri etal., 1998, Nature 394(6690):287-9L 

25 SUMMARY OF THE INVENTION 

The present invention solves many needs in the art by providing nucleic acid 
molecules, polypeptides and antibodies thereto, variants and derivatives of the nucleic 
acids and polypeptides, and agonists and antagonists thereto that may be used to identify, 
diagnose, monitor, stage, image and treat colon cancer and/or non-cancerous disease states 

30 in colon; identify and monitor colon tissue; and identify and design agonists and 

antagonists of polypeptides of the invention. The invention also provides gene therapy, 
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methods for producing transgenic animals and cells, and methods for producing 
engineered colon tissue for treatment and research. 

One aspect of the present invention relates to nucleic acid molecules that are 
specific to colon cells, colon tissue and/or the colon organ. These colon specific nucleic 

5 acids (CSNAs) may be a naturally occurring cDNA, genomic DNA, RNA, or a fragment 
of one of these nucleic acids, or may be a non-naturally occurring nucleic acid molecule. 
If the CSNA is genomic DNA, then the CSNA is a colon specific gene (CSG). If the 
CSNA is RNA, then it is a colon specific transcript encoded by a CSG. Due to alternative 
splicing and transcriptional modification one CSG may encode for multiple colon specific 

10 RNAs. In a preferred embodiment, the nucleic acid molecule encodes a polypeptide that 
is specific to colon. More preferred is a nucleic acid molecule that encodes a polypeptide 
comprising an amino acid sequence of SEQ ID NO: 1 1 3-259. In another preferred 
embodiment, the nucleic acid molecule comprises a nucleic acid sequence of SEQ ID NO: 
1-1 12. For the CSNA sequences listed herein, DEX0449_001.nt 1 corresponds to SEQ ID 

15 NO: 1. For sequences with multiple splice variants, the parent sequence 

DEX0449_001.nt.l, will be followed by DEX0449J)01.nt2, etc. for each splice variant 
The sequences off the corresponding peptides are listed as DEX0449_001 .aa.l, etc. For 
the mapping of all of the nucleotides and peptides, see the table in the Example 1 section 
below. 

20 This aspect of the present invention also relates to nucleic acid molecules that 

selectively hybridize or exhibit substantial sequence similarity to nucleic acid molecules 
encoding a Colon Specific Protein (CSP), or that selectively hybridize or exhibit 
substantial sequence similarity to a CSNA. In one embodiment of the present invention 
the nucleic acid molecule comprises an allelic variant of a nucleic acid molecule encoding 

25 a CSP, or an allelic variant of a CSNA. In another embodiment, the nucleic acid molecule 
comprises a part of a nucleic acid sequence that encodes a CSP or a part of a nucleic acid 
sequence of a CSNA. 

In addition, this aspect of the present invention relates to a nucleic acid molecule 
further comprising one or more expression control sequences controlling the transcription 

30 and/or translation of all or a part of a CSNA or the transcription and/or translation of a 
nucleic acid molecule that encodes all or a fragment of a CSP. 

Another aspect of the present invention relates to vectors and/or host cells 
comprising a nucleic acid molecule of this invention. In a preferred embodiment, the 
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nucleic acid molecule of the vector and/or host cell encodes all or a fragment of a CSP. In 
another preferred embodiment, the nucleic acid molecule of the vector and/or host cell 
comprises all or a part of a CSNA. Vectors and host cells of the present invention are 
useful in the recombinant production of polypeptides, particularly CSPs of the present 
5 invention. 

Another aspect of the present invention relates to polypeptides encoded by a 
nucleic acid molecule of this invention. The polypeptide may comprise either a fragment 
or a full-length protein. In a preferred embodiment, the polypeptide is a CSP. However, 
this aspect of the present invention also relates to mutant proteins (muteins) of CSPs, 

10 fusion proteins of which a portion is a CSP, and proteins and polypeptides encoded by 
allelic variants of a CSNA as provided herein. 

A further aspect of the present invention is a novel splice variant which encodes an 
amino acid sequence that provides a novel region to be targeted for the generation of 
reagents that can be used in the detection and/or treatment of cancer. The novel amino 

15 acid sequence may lead to a unique protein structure, protein subcellular localization, 

biochemical processing or function. This information can be used to directly or indirectly 
facilitate the generation of additional or novel therapeutics or diagnostics. The nucleotide 
sequence in this novel splice variant can be used as a nucleic acid probe for the diagnosis 
and/or treatment of cancer. 

20 Another aspect of the present invention relates to antibodies and other binders that 

specifically bind to a polypeptide of the instant invention. Accordingly antibodies or 
binders of the present invention specifically bind to CSPs, muteins, fusion proteins, and/or 
homologous proteins or polypeptides encoded by allelic variants of a CSNA as provided 
herein. 

25 Another aspect of the present invention relates to agonists and antagonists of the 

nucleic acid molecules and polypeptides of this invention. The agonists and antagonists of 
the instant invention may be used to treat colon cancer and non-cancerous disease states in 
colon and to produce engineered colon tissue. 

Another aspect of the present invention relates to methods for using the nucleic 

30 acid molecules to detect or amplify nucleic acid molecules that have similar or identical 
nucleic acid sequences compared to the nucleic acid molecules described herein. Such 
methods are useful in identifying, diagnosing, monitoring, staging, imaging and treating 
colon cancer and/or non-cancerous disease states in colon. Such methods are also useful 
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in identifying and/or monitoring colon tissue. In addition, measurement of levels of one or 
more of the nucleic acid molecules of this invention may be useful as a diagnostic as part 
of a panel in combination with known other markers, particularly those described in the 
colon cancer background section above. 

5 Another aspect of the present invention relates to use of the nucleic acid molecules 

of this invention in gene therapy, for producing transgenic animals and cells, and for 
producing engineered colon tissue for treatment and research. 

Another aspect of the present invention relates to methods for detecting 
polypeptides of this invention, preferably using antibodies thereto. Such methods are 

10 useful to identify, diagnose, monitor, stage, image and treat colon cancer and non- 
cancerous disease states in colon. In addition, measurement of levels of one or more of 
the polypeptides of this invention may be useful to identify, diagnose, monitor, stage, 
and/or image colon cancer in combination with known other markers, particularly those 
described in the colon cancer background section above. The polypeptides of the present 

1 5 invention can also be used to identify and/or monitor colon tissue, and to produce 
engineered colon tissue. 

Yet another aspect of the present invention relates to a computer readable means of 
storing the nucleic acid and amino acid sequences of the invention. The records of the 
computer readable means can be accessed for reading and displaying of sequences for 

20 comparison, alignment and ordering of the sequences of the invention to other sequences. 
In addition, the computer records regarding the nucleic acid and/or amino acid sequences 
and/or measurements of their levels may be used alone or in combination with other 
markers to diagnose colon related diseases. 

DETAILED DESCRIPTION OF THE INVENTION 
25 Definitions and General Techniques 

Unless otherwise defined herein, scientific and technical terms used in connection 
with the present invention shall have the meanings that are commonly understood by those 
of ordinary skill in the art Further, unless otherwise required by context, singular terms 
shall include pluralities and plural terms shall include the singular. Generally, 
30 nomenclatures used in connection with, and techniques of, cell and tissue culture, 

molecular biology, immunology, microbiology, genetics and protein and nucleic acid 
chemistry and hybridization described herein are those well known and commonly used in 
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the art The methods and techniques of the present invention are generally performed 
according to conventional methods well known in the art and as described in various 
general and more specific references that are cited and discussed throughout the present 
specification unless otherwise indicated. See, e.g., Sambrook et aL, Molecular Cloning: A 

5 Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press (1989) and Sambrook et 
al. 9 Molecular dmrin p: A Laboratory Manual. 3d ed., Cold Spring Harbor Press (2001); 
Ausubel et ah, Current Protocols in Molecular Biology. Greene Publishing Associates 
(1992, and Supplements to 2000); Ausubel et aL 9 Short Protocols in Molecular Biology: A 
Compendium of Methods from Current Protocols in Molecular Biology — 4 th Ed.. Wiley & 

10 Sons (1999); Harlow and Lane, Antibodies: A Laboratory Manual. Cold Spring Harbor 
Laboratory Press (1990); and Harlow and Lane, Using Antibodies: A Laboratory Manual. 
Cold Spring Harbor Laboratory Press (1999). 

Enzymatic reactions and purification techniques are performed according to 
manufacturer's specifications, as commonly accomplished in the art or as described 

1 5 herein. The nomenclatures used in connection with, and the laboratory procedures and 
techniques of, analytical chemistry, synthetic organic chemistry, and medicinal and 
pharmaceutical chemistry described herein are those well known and commonly used in 
the art Standard techniques are used for chemical syntheses, chemical analyses, 
pharmaceutical preparation, formulation, and delivery, and treatment of patients. 

20 The following terms, unless otherwise indicated, shall be understood to have the 

following meanings: 

A "nucleic acid molecule" of this invention refers to a polymeric form of 
nucleotides and includes both sense and antisense strands of RNA, cDNA, genomic DNA, 
and synthetic forms and mixed polymers of the above. A nucleotide refers to a 

25 ribonucleotide, deoxynucleotide or a modified form of either type of nucleotide. A 
"nucleic acid molecule" as used herein is synonymous with "nucleic acid" and 
"polynucleotide." The term "nucleic acid molecule" usually refers to a molecule of at 
least 10 bases in length, unless otherwise specified. The term includes single- and double- 
stranded forms of DNA. In addition, a polynucleotide may include either or both naturally 

30 occurring and modified nucleotides linked together by naturally occurring and/or 
non-naturally occurring nucleotide linkages. 

Nucleotides are represented by single letter symbols in nucleic acid molecule 
sequences. The following table lists symbols identifying nucleotides or groups of 



WO 2004/050900 



PCT/US2003/040131 



15 

nucleotides which may occupy the symbol position on a nucleic acid molecule. See 
Nomenclature Committee of the International Union of Biochemistry (NC-IUB), 



Nomenclature for incompletely specified bases in nucleic acid sequences, 
Recommendations 1984., Eur JBiochem. 150(l):l-5 (1985). 



Symbol 


Meaning 


Group/Origin of Designation 


Complementary 
Symbol 


a 


a 


Adenine 


t/u 


g 




Guanine 


c 


c 


c 


Cytosine 




t 


t 


Thymine 


a 


u 


u 


Uracil 


a 


r 


g or a 


puRine 


y 


y 


t/u or c 


pYrimidine 


r 


m 


a or c 


aMino 


k 


k 


g or t/u 


Keto 


m 


s 


g or c 


Strong interactions 3H-bonds 


w 


w 


a or t/u 


Weak interactions 2H-bonds 


s 


b 


g or c or t/u 


not a 


V 


d 


a or g or t/u 


not c 


h 


h 


a or c or t/u 




d 


V 


a or g or c 


not t, not u 


b 


n 


a or g or c 
or t/u, 
unknown, or 
other 


aNy 


n 



5 

The nucleic acid molecules may be modified chemically or biochemically or may 
contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those 
of skill in the art Such modifications include, for example, labels, me thylation, 
substitution of one or more of the naturally occurring nucleotides with an analog, 

10 internucleotide modifications such as uncharged linkages {e.g. , methyl phosphonates, 
phosphotriesters, phosphoramidates, carbamates, etc.), charged linkages (e.g 9 
phosphorothioates, phosphorodithioates, etc.), pendent moieties (e.g., polypeptides), 
intercalators (e.g. 9 acridine, psoralen, etc.), chelators, alkylators, and modified linkages 
(e.g. 9 alpha anomeric nucleic acids, etc.) The term "nucleic acid molecule" also includes 

1 5 any topological conformation, including single-stranded, double-stranded, partially 

duplexed, triplexed, hairpinned, circular and padlocked conformations. Also included are 
synthetic molecules that mimic polynucleotides in their ability to bind to a designated 
sequence via hydrogen bonding and other chemical interactions. Such molecules are 
known in the art and include, for example, those in which peptide linkages substitute for 

20 phosphate linkages in the backbone of the molecule. 
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A "gene" is defined as a nucleic acid molecule that comprises a nucleic acid 
sequence that encodes a polypeptide and the expression control sequences that surround 
the nucleic acid sequence that encodes the polypeptide. For instance, a gene may 
comprise a promoter, one or more enhancers, a nucleic acid sequence that encodes a 
5 polypeptide, downstream regulatory sequences and, possibly, other nucleic acid sequences 
involved in regulation of the expression of an RNA. As is well known in the art, 
eukaiyotic genes usually contain both exons and introns. The term "exon" refers to a 
nucleic acid sequence found in genomic DNA that is bioinformatically predicted and/or 
experimentally confirmed to contribute contiguous sequence to a mature mRNA 

10 transcript The term "intron" refers to a nucleic acid sequence found in genomic DNA that 
is predicted and/or confirmed to not contribute to a mature mRNA transcript, but rather to 
be "spliced out" during processing of the transcript. 

A nucleic acid molecule or polypeptide is "derived" from a particular species if the 
nucleic acid molecule or polypeptide has been isolated from the particular species, or if the 

15 nucleic acid molecule or polypeptide is homologous to a nucleic acid molecule or 
polypeptide isolated from a particular species. 

An "isolated" or "substantially pure" nucleic acid or polynucleotide (e.g., an RNA, 
DNA or a mixed polymer) is one which is substantially separated from other cellular 
components that naturally accompany the native polynucleotide in its natural host cell, 

20 e.g., ribosomes, polymerases, or genomic sequences with which it is naturally associated. 
The term embraces a nucleic acid or polynucleotide that (1) has been removed from its 
naturally occurring environment, (2) is not associated with all or a portion of a 
polynucleotide in which the "isolated polynucleotide" is found in nature, (3) is operatively 
linked to a polynucleotide which it is not linked to in nature, (4) does not occur in nature 

25 as part of a larger sequence or (5) includes nucleotides or internucleoside bonds that are 
not found in nature. The term "isolated" or "substantially pure" also can be used in 
reference to recombinant or cloned DNA isolates, chemically synthesized polynucleotide 
analogs, or polynucleotide analogs that are biologically synthesized by heterologous 
systems. The term "isolated nucleic acid molecule" includes nucleic acid molecules that 

30 are integrated into a host cell chromosome at a heterologous site, recombinant fusions of a 
native fragment to a heterologous sequence, recombinant vectors present as episomes or as 
integrated into a host cell chromosome. 
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A "part" of a nucleic acid molecule refers to a nucleic acid molecule that 
comprises a partial contiguous sequence of at least 10 bases of the reference nucleic acid 
molecule. Preferably, a part comprises at least 15 to 20 bases of a reference nucleic acid 
molecule. In theory, a nucleic acid sequence of 17 nucleotides is of sufficient length to 
5 occur at random less frequently than once in the three gigabase human genome, and thus 
provides a nucleic acid probe that can uniquely identify the reference sequence in a 
nucleic acid mixture of genomic complexity. A preferred part is one that comprises a 
nucleic acid sequence that can encode at least 6 contiguous amino acid sequences 
(fragments of at least 18 nucleotides) because they are useful in directing the expression or 
10 synthesis of peptides that are useful in mapping the epitopes of the polypeptide encoded 
by the reference nucleic acid. See, e.g., Geysen et al, Proc. Natl. Acad. Set USA 
81:3998-4002 (1984); and U.S. Patent Nos. 4,708,871 and 5,595,915, the disclosures of 
which are incorporated herein by reference in their entireties. A part may also comprise at 
least 25, 30, 35 or 40 nucleotides of a reference nucleic acid molecule, or at least 50, 60, 
15 70, 80, 90, 100, 150, 200, 250, 300, 350, 400 or 500 nucleotides of a reference nucleic 
acid molecule. A part of a nucleic acid molecule may comprise no other nucleic acid 
sequences. Alternatively, a part of a nucleic acid may comprise other nucleic acid 
sequences from other nucleic acid molecules. 

The term "oligonucleotide" refers to a nucleic acid molecule generally comprising 
20 a length of 200 bases or fewer. The term often refers to single-stranded 

deoxyribonucleotides, but it can refer as well to single-or double-stranded ribonucleotides, 
RNA:DNA hybrids and double-stranded DNAs, among others. Preferably, 
oligonucleotides are 10 to 60 bases in length and most preferably 12, 13, 14, 15, 16, 17, 
18, 19 or 20 bases in length. Other preferred oligonucleotides are 25, 30, 35, 40, 45, 50, 
25 55 or 60 bases in length. Oligonucleotides may be single-stranded, e.g. for use as probes 
or primers, or may be double-stranded, e.g. for use in the construction of a mutant gene. 
Oligonucleotides of the invention can be either sense or antisense oligonucleotides. An 
oligonucleotide can be derivatized or modified as discussed above for nucleic acid 
molecules. 

30 Oligonucleotides, such as single-stranded DNA probe oligonucleotides, often are 

synthesized by chemical methods, such as those implemented on automated 
oligonucleotide synthesizers. However, oligonucleotides can be made by a variety of 
other methods, including in vitro recombinant DNA-mediated techniques and by 
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expression of DNAs in cells and organisms. Initially, chemically synthesized DNAs 
typically are obtained without a 5* phosphate. The 5' ends of such oligonucleotides are 
not substrates for phosphodiester bond formation by ligation reactions that employ DNA 
ligases typically used to form recombinant DNA molecules. Where ligation of such 
5 oligonucleotides is desired, a phosphate can be added by standard techniques, such as 
those that employ a kinase and ATP. The 3' end of a chemically synthesized 
oligonucleotide generally has a free hydroxyl group and, in the presence of a ligase, such 
as T4 DNA ligase, readily will form a phosphodiester bond with a 5' phosphate of another 
polynucleotide, such as another oligonucleotide. As is well known, this reaction can be 
10 prevented selectively, where desired, by removing the 5' phosphates of the other 
polynucleotide(s) prior to ligation. 

The term "naturally occurring nucleotide" referred to herein includes naturally 
occurring deoxyribonucleotides and ribonucleotides. The term "modified nucleotides" 
referred to herein includes nucleotides with modified or substituted sugar groups and the 
15 like. The term "nucleotide linkages" referred to herein includes nucleotide linkages such 
as phosphorothioate, phosphorodithioate, phosphoroselenoate, phosphorodiselenoate, 
phosphoroanilothioate, phoshoraniladate, phosphoroamidate, and the like. See e.g> 
LaPlanche et al Nucl Acids Res. 14:9081-9093 (1986); Stein et al Nucl Acids Res. 
16:3209-3221 (1988); Zon et al. Anti-Cancer Drug Design 6:539-568 (1991); Zon et al 9 
20 in Eckstein (ed.) Oligonucleotides and Analogues: A Practical Approach, pp. 87-108, 

Oxford University Press (1991); Uhlmann and Peyman Chemical Reviews 90:543 (1990), 
and U.S. Patent No. 5,151,510, the disclosure of which is hereby incorporated by 
reference in its entirety. 

Unless specified otherwise, the left hand end of a polynucleotide sequence in sense 
25 orientation is the 5 9 end and the right hand end of the sequence is the 3' end. In addition, 
the left hand direction of a polynucleotide sequence in sense orientation is referred to as 
the 5' direction, while the right hand direction of the polynucleotide sequence is referred 
to as the 3' direction. Further, unless otherwise indicated, each nucleotide sequence is set 
forth herein as a sequence of deoxyribonucleotides. It is intended, however, that the given 
30 sequence be interpreted as would be appropriate to the polynucleotide composition: for 
example, if the isolated nucleic acid is composed of RNA, the given sequence intends 
ribonucleotides, with uridine substituted for thymidine. 
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The term "allelic variant" refers to one of two or more alternative naturally 
occurring forms of a gene, wherein each gene possesses a unique nucleotide sequence. In 
a preferred embodiment, different alleles of a given gene have similar or identical 
biological properties. 

5 The term "percent sequence identity" in the context of nucleic acid sequences 

refers to the residues in two sequences which are the same when aligned for maximum 
correspondence. The length of sequence identity comparison may be over a stretch of at 
least about nine nucleotides, usually at least about 20 nucleotides, more usually at least 
about 24 nucleotides, typically at least about 28 nucleotides, more typically at least about 
10 32 nucleotides, and preferably at least about 36 or more nucleotides. There are a number 
of different algorithms known in the art which can be used to measure nucleotide sequence 
identity. For instance, polynucleotide sequences can be compared using FASTA, Gap or 
Bestfit, which are programs in Wisconsin Package Version 10.0, Genetics Computer 
Group (GCG), Madison, Wisconsin. FASTA, which includes, e.g., the programs FASTA2 
1 5 and FASTA3 , provides alignments and percent sequence identity of the regions of the best 
overlap between the query and search sequences (Pearson, Methods Enzymol. 183: 63-98 
(1990); Pearson, Methods Mol Biol 132: 185-219 (2000); Pearson, Methods Enzymol 
266: 227-258 (1996); Pearson, /. Mol Biol 276: 71-84 (1998)). Unless otherwise 
specified, default parameters for a particular program or algorithm are used. For instance, 
20 percent sequence identity between nucleic acid sequences can be determined using 
FASTA with its default parameters (a word size of 6 and the NOP AM factor for the 
scoring matrix) or using Gap with its default parameters as provided in GCG Version 6.1. 

A reference to a nucleic acid sequence encompasses its complement unless 
otherwise specified. Thus, a reference to a nucleic acid molecule having a particular 
25 sequence should be understood to encompass its complementary strand, with its 

complementary sequence. The complementary strand is also useful, e.g. 9 for antisense 
therapy, double-stranded RNA (dsRNA) inhibition (RNAi), combination of triplex and 
antisense, hybridization probes and PCR primers. 

In the molecular biology art, researchers use the terms "percent sequence identity", 
30 "percent sequence similarity" and "percent sequence homology" interchangeably. In this 
application, these terms shall have the same meaning with respect to nucleic acid 
sequences only. 
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The term "substantial similarity" or "substantial sequence similarity," when 
referring to a nucleic acid or fragment thereof, indicates that, when optimally aligned with 
appropriate nucleotide insertions or deletions with another nucleic acid (or its 
complementary strand), there is nucleotide sequence identity in at least about 50%, more 
5 preferably 60% of the nucleotide bases, usually at least about 70%, more usually at least 
about 80%, preferably at least about 90%, and more preferably at least about 95-98% of 
the nucleotide bases, as measured by any well known algorithm of sequence identity, such 
as FASTA, BLAST or Gap, as discussed above. 

Alternatively, substantial similarity exists between a first and second nucleic acid 

10 sequence when the first nucleic acid sequence or fragment thereof hybridizes to an 
antisense strand of the second nucleic acid, under selective hybridization conditions. 
Typically, selective hybridization will occur between the first nucleic acid sequence and 
an antisense strand of the second nucleic acid sequence when there is at least about 55% 
sequence identity between the first and second nucleic acid sequences — preferably at least 

1 5 about 65%, more preferably at least about 75%, and most preferably at least about 90% — 
over a stretch of at least about 14 nucleotides, more preferably at least 17 nucleotides, 
even more preferably at least 20, 25, 30, 35, 40, 50, 60, 70, 80, 90 or 100 nucleotides. 

Nucleic acid hybridization will be affected by such conditions as salt 
concentration, temperature, solvents, the base composition of the hybridizing species, 

20 length of the complementary regions, and the number of nucleotide base mismatches 

between the hybridizing nucleic acids, as will be readily appreciated by those skilled in the 
art. "Stringent hybridization conditions" and "stringent wash conditions" in the context 
of nucleic acid hybridization experiments depend upon a number of different physical 
parameters. The most important parameters include temperature of hybridization, base 

25 composition of the nucleic acids, salt concentration and length of the nucleic acid. One 
having ordinary skill in the art knows how to vary these parameters to achieve a particular 
stringency of hybridization. In general, "stringent hybridization" is performed at about 
25°C below the thermal melting point (T m ) for the specific DNA hybrid under a particular 
set of conditions. "Stringent washing" is performed at temperatures about 5°C lower than 

30 the T m for the specific DNA hybrid under a particular set of conditions. The T m is the 

temperature at which 50% of the target sequence hybridizes to a perfectly matched probe. 
See Sambrook (1989), supra, p. 9.51. 

The T m for a particular DNA-DNA hybrid can be estimated by the formula: 
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T m = 81.5°C + 16.6 (logioCNa 4 ]) + 0.41 (fraction G + C) - 
0.63 (% formamide) - (600/1) where 1 is the length of the hybrid in base pairs. 
The T m for a particular RNA-RNA hybrid can be estimated by the formula: 
T m = 79.8°C + 18.5 (logiotNa 4 ]) + 0.58 (fraction G + C) + 
5 11.8 (fraction G + C) 2 - 0.35 (% formamide) - (820/1). 

The T ra for a particular RNA-DNA hybrid can be estimated by the formula: 
T m = 79.8°C + 18.5(logio[Na + ]) + 0.58 (fraction G + C) + 
11.8 (fraction G + C) 2 - 0.50 (% formamide) - (820/1). 

In general, the T m decreases by 1-1. 5°C for each 1% of mismatch between two 

10 nucleic acid sequences. Thus, one having ordinary skill in the art can alter hybridization 
and/or washing conditions to obtain sequences that have higher or lower degrees of 
sequence identity to the target nucleic acid. For instance, to obtain hybridizing nucleic 
acids that contain up to 10% mismatch from the target nucleic acid sequence, 10-15°C 
would be subtracted from the calculated T m of a perfectly matched hybrid, and then the 

15 hybridization and washing temperatures adjusted accordingly. Probe sequences may also 
hybridize specifically to duplex DNA under certain conditions to form triplex or other 
higher order DNA complexes. The preparation of such probes and suitable hybridization 
conditions are well known in the art. 

An example of stringent hybridization conditions for hybridization of 

20 complementary nucleic acid sequences having more than 100 complementary residues on 
a filter in a Southern or Northern blot or for screening a library is 50% formamide/6X SSC 
at 42°C for at least ten hours and preferably overnight (approximately 16 hours). Another 
example of stringent hybridization conditions is 6X SSC at 68°C without formamide for at 
least ten hours and preferably overnight. An example of moderate stringency 

25 hybridization conditions is 6X SSC at 55°C without formamide for at least ten hours and 
preferably overnigjit. An example of low stringency hybridization conditions for 
hybridization of complementary nucleic acid sequences having more than 100 
complementary residues on a filter in a Southern or northern blot or for screening a library 
is 6X SSC at 42°C for at least ten hours. Hybridization conditions to identify nucleic acid 

30 sequences that are similar but not identical can be identified by experimentally changing 
the hybridization temperature from 68°C to 42°C while keeping the salt concentration 
constant (6X SSC), or keeping the hybridization temperature and salt concentration 
constant (e.g. 42°C and 6X SSC) and varying the formamide concentration from 50% to 
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0%. Hybridization buffers may also include blocking agents to lower background. These 
agents are well known in the art See Sambrook et al. (1989), supra, pages 8.46 and 9.46- 
9.58. See also Ausubel (1992), supra, Ausubel (1999), supra, and Sambrook (2001), 
supra. 

5 Wash conditions also can be altered to change stringency conditions. An example 

of stringent wash conditions is a 0.2x SSC wash at 65°C for 15 minutes (see Sambrook 
(1989), supra, for SSC buffer). Often the high stringency wash is preceded by a low 
stringency wash to remove excess probe. An exemplary medium stringency wash for 
duplex DNA of more than 100 base pairs is lx SSC at 45°C for 15 minutes. An 

10 exemplary low stringency wash for such a duplex is 4x SSC at 40°C for 15 minutes. In 
general, signal-to-noise ratio of 2x or higher than that observed for an unrelated probe in 
the particular hybridization assay indicates detection of a specific hybridization. 

As defined herein, nucleic acids that do not hybridize to each other under stringent 
conditions are still substantially similar to one another if they encode polypeptides that are 

15 substantially identical to each other. This occurs, for example, when a nucleic acid is 

created synthetically or recombinantly using a high codon degeneracy as permitted by the 
redundancy of the genetic code. 

Hybridization conditions for nucleic acid molecules that are shorter than 100 
nucleotides in length (e.g., for oligonucleotide probes) may be calculated by the formula: 

20 T m = 81.5°C + le.eOogiotNa*]) + 0.41(fraction G+C) -(600/N), wherein N is 

change length and the [Na*] is 1 M or less. See Sambrook (1989), supra, p. 11.46. For 
hybridization of probes shorter than 100 nucleotides, hybridization is usually performed 
under stringent conditions (5-10°C below the T m ) using high concentrations (0.1-1.0 
pmol/ml) of probe. Id. at p. 1 1.45. Determination of hybridization using mismatched 

25 probes, pools of degenerate probes or "guessmers," as well as hybridization solutions and 
methods for empirically determining hybridization conditions are well known in the art. 
See, e.g., Ausubel (1999), supra; Sambrook (1989), supra, pp. 11.45-11.57. 

The term "digestion" or "digestion of DNA" refers to catalytic cleavage of the 
DNA with a restriction enzyme that acts only at certain sequences in the DNA. The 

30 various restriction enzymes referred to herein are commercially available and their 

reaction conditions, cofactors and other requirements for use are known and routine to the 
skilled artisan. For analytical purposes, typically, 1 pg of plasmid or DNA fragment is 
digested with about 2 units of enzyme in about 20 ]il of reaction buffer. For the purpose of 
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isolating DNA fragments for plasmid construction, typically 5 to 50 pg of DNA are 
digested with 20 to 250 units of enzyme in proportionately larger volumes. Appropriate 
buffers and substrate amounts for particular restriction enzymes are described in standard 
laboratory manuals, such as those referenced below, and are specified by commercial 
5 suppliers. Incubation times of about 1 hour at 37°C are ordinarily used, but conditions 
may vary in accordance with standard procedures, the supplier's instructions and the 
particulars of the reaction. After digestion, reactions may be analyzed, and fragments may 
be purified by electrophoresis through an agarose or polyacrylamide gel, using well 
known methods that are routine for those skilled in the art. 
10 The term "ligation" refers to the process of forming phosphodiester bonds between 

two or more polynucleotides, which most often are double-stranded DNAs. Techniques 
for ligation are well known to the art and protocols for ligation are described in standard 
laboratory manuals and references, such as, e.g., Sambrook (1989), supra. 

Genome-derived "single exon probes," are probes that comprise at least part of an 
15 exon ("reference exon") and can hybridize detectably under high stringency conditions to 
transcript-derived nucleic acids that include the reference exon but do not hybridize 
detectably under high stringency conditions to nucleic acids that lack the reference exon. 
Single exon probes typically further comprise, contiguous to a first end of the exon 
portion, a first intronic and/or intergenic sequence that is identically contiguous to the 
20 exon in the genome, and may contain a second intronic and/or intergenic sequence that is 
identically contiguous to the exon in the genome. The minimum length of genome- 
derived single exon probes is defined by the requirement that the exonic portion be of 
sufficient length to hybridize under high stringency conditions to transcript-derived 
nucleic acids, as discussed above. The maximum length of genome-derived single exon 
25 probes is defined by the requirement that the probes contain portions of no more than one 
exon. The single exon probes may contain priming sequences not found in contiguity with 
the rest of the probe sequence in the genome, which priming sequences are useful for PCR 
and other amplification-based technologies. In another aspect, the invention is directed to 
single exon probes based on the CSNAs disclosed herein. 
30 In one embodiment, the term cc microarray" refers to a "nucleic acid microarray" 

having a substrate-bound plurality of nucleic acids, hybridization to each of the plurality 
of bound nucleic acids being separately detectable. The substrate can be solid or porous, 
planar or non-planar, unitary or distributed. Nucleic acid microarrays include all the 
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devices so called in Schena (ed.), DNA Microarravs: A Practical Ap proach (Practical 
Approach Series^ Oxford University Press (1999); Nature Genet 21(l)(suppl.):l - 60 

(1999) ; Schena (ed.), Microarrav Biochin: Tools and Technology. Eaton Publishing 
Company/BioTechniques Books Division (2000). Additionally, these nucleic acid 

5 microarrays include a substrate-bound plurality of nucleic acids in which the plurality of 
nucleic acids are disposed on a plurality of beads, rather than on a unitary planar substrate, 
as is described, inter alia, in Brenner et aL, Proc. Natl Acad. Set USA 97(4): 1665- 1670 

(2000) . Examples of nucleic acid microarrays may be found in U.S. Patent Nos. 
6,391,623, 6,383,754, 6,383,749, 6,380,377, 6,379,897, 6,376,191, 6,372,431, 6,351,712 

10 6,344,316, 6,316,193, 6,312,906, 6,309,828, 6,309,824, 6,306,643, 6,300,063, 6,287,850, 
6,284,497, 6,284,465, 6,280,954, 6,262,216, 6,251,601, 6,245,518, 6,263,287, 6,251,601, 
6,238,866, 6,228,575, 6,214,587, 6,203,989, 6,171,797, 6,103,474, 6,083,726, 6,054,274, 
6,040,138, 6,083,726, 6,004,755, 6,001,309, 5,958,342, 5,952,180, 5,936,731, 5,843,655, 
5,814,454, 5,837,196, 5,436,327, 5,412,087, and 5,405,783, the disclosures of which are 

1 5 incorporated herein by reference in their entireties. 

In an alternative embodiment, a "microarray" may also refer to a "peptide 
microarray" or "protein microarray" having a substrate-bound collection or plurality of 
polypeptides, the binding to each of the plurality of bound polypeptides being separately 
detectable. Alternatively, the peptide microarray may have a plurality of binders, 

20 including but not limited to monoclonal antibodies, polyclonal antibodies, phage display 
binders, yeast 2 hybrid binders, and aptamers, which can specifically detect the binding of 
the polypeptides of this invention. The array may be based on autoantibody detection to 
the polypeptides of this invention, see Robinson et aL, Nature Medicine 8(3):295-301 
(2002). Examples of peptide arrays may be found in WO 02/31463, WO 02/25288, WO 

25 01/94946, WO 01/88162, WO 01/68671, WO 01/57259, WO 00/61806, WO 00/54046, 
WO 00/47774, WO 99/40434, WO 99/39210, and WO 97/42507 and U.S. Patent Nos. 
6,268,210, 5,766,960, and 5,143,854, the disclosures of which are incorporated herein by 
reference in their entireties. 

In addition, determination of the levels of the CSNA or CSP may be made in a 

30 multiplex manner using techniques described in WO 02/29109, WO 02/24959, WO 
01/83502, WO01/731 13, WO 01/59432, WO 01/57269, and WO 99/67641, the 
disclosures of which are incorporated herein by reference in their entireties. 
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The term "mutant* \ "mutated", or "mutation" when applied to nucleic acid 
sequences means that nucleotides in a nucleic acid sequence may be inserted, deleted or 
changed compared to a reference nucleic acid sequence. A single alteration may be made 
at a locus (a point mutation) or multiple nucleotides may be inserted, deleted or changed at 

5 a single locus. In addition, one or more alterations may be made at any number of loci 
within a nucleic acid sequence. In a preferred embodiment of the present invention, the 
nucleic acid sequence is the wild type nucleic acid sequence encoding a CSP or is a 
CSNA. The nucleic acid sequence may be mutated by any method known in the art 
including those mutagenesis techniques described infra. 

10 The term "error-prone PCR" refers to a process for performing PCR under 

conditions where the copying fidelity of the DNA polymerase is low, such that a high rate 
of point mutations is obtained along the entire length of the PCR product. See, e.g, Leung 
etal, Technique 1: 11-15 (1989) and Caldwell al, PCR Methods Applic. 2: 28-33 
(1992). 

15 The term "oligonucleotide-directed mutagenesis" refers to a process which enables 

the generation of site-specific mutations in any cloned DNA segment of interest. See, e.g., 

Reidhaar-Olson et al, Science 241: 53-57 (1988). 

The term "assembly PCR" refers to a process which involves the assembly of a 

PCR product from a mixture of small DNA fragments. A large number of different PCR 
20 reactions occur in parallel in the same vial, with the products of one reaction priming the 

products of another reaction. 

The term "sexual PCR mutagenesis" or "DNA shuffling" refers to a method of 

error-prone PCR coupled with forced homologous recombination between DNA 

molecules of different but highly related DNA sequence in vitro, caused by random 
25 fragmentation of the DNA molecule based on sequence similarity, followed by fixation of 

the crossover by primer extension in an error-prone PCR reaction. See, e.g. , Stemmer, 

Proc. Natl. Acad. ScL U.S.A. 91: 10747-10751 (1994). DNA shuffling can be carried out 

between several related genes ('Tamily shuffling"). 

The term "zw vivo mutagenesis" refers to a process of generating random mutations 
30 in any cloned DNA of interest which involves the propagation of the DNA in a strain of 

bacteria such as E. coli that carries mutations in one or more of the DNA repair pathways. 

These "mutator" strains have a higher random mutation rate than that of a wild-type 
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parent Propagating the DNA in a mutator strain will eventually generate random 
mutations within the DNA. 

The term "cassette mutagenesis" refers to any process for replacing a small region 
of a double-stranded DNA molecule with a synthetic oligonucleotide "cassette" that 
5 differs from the native sequence. The oligonucleotide often contains completely and/or 
partially randomized native sequence. 

The term "recursive ensemble mutagenesis" refers to an algorithm for protein 
engineering (protein mutagenesis) developed to produce diverse populations of 
phenotypically related mutants whose members differ in amino acid sequence. This 
10 method uses a feedback mechanism to control successive rounds of combinatorial cassette 
mutagenesis. See, e.g., Arkin et al, Proc. Natl. Acad. Set U.S.A. 89: 7811-7815 (1992). 

The term "exponential ensemble mutagenesis" refers to a process for generating 
combinatorial libraries with a high percentage of unique and functional mutants, wherein 
small groups of residues are randomized in parallel to identify, at each altered position, 
1 5 amino acids which lead to functional proteins. See, e.g. , Delegrave et al. , Biotechnology 
Research 11:1 548-1 552 (1993); Arnold, Current Opinion in Biotechnology 4: 450-455 
(1993). 

"Operatively linked" expression control sequences refers to a linkage in which the 
expression control sequence is either contiguous with the gene of interest to control the 
20 gene of interest, or acts in trans or at a distance to control the gene of interest 

The term "expression control sequence" as used herein refers to polynucleotide 
sequences which are necessary to affect the expression of coding sequences to which they 
are operatively linked. Expression control sequences are sequences which control the 
transcription, post-transcriptional events and translation of nucleic acid sequences. 
25 Expression control sequences include appropriate transcription initiation, termination, 

promoter and enhancer sequences; efficient RNA processing signals such as splicing and 
polyadenylation signals; sequences that stabilize cytoplasmic mRNA; sequences that 
enhance translation efficiency {e.g., ribosome binding sites); sequences that enhance 
protein stability; and when desired, sequences that enhance protein secretion. The nature 
30 of such control sequences differs depending upon the host organism; in prokaiyotes, such 
control sequences generally include promoter, ribosomal binding site, and transcription 
termination sequence. The term "control sequences" is intended to include, at a minimum, 
all components whose presence is essential for expression, and can also include additional 
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components whose presence is advantageous, for example, leader sequences and fusion 
partner sequences. 

The term "vector," as used herein, is intended to refer to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 

5 vector is a "plasmid", which refers to a circular double-stranded DNA loop into which 
additional DNA segments may be ligated. Other vectors include cosmids, bacterial 
artificial chromosomes (BAC) and yeast artificial chromosomes (Y AC). Another type of 
vector is a viral vector, wherein additional DNA segments may be ligated into the viral 
genome. Viral vectors that infect bacterial cells are referred to as bacteriophages. Certain 

10 vectors are capable of autonomous replication in a host cell into which they are introduced 
(e.g.> bacterial vectors having a bacterial origin of replication). Other vectors can be 
integrated into the genome of a host cell upon introduction into the host cell, and thereby 
are replicated along with the host genome. Moreover, certain vectors are capable of 
directing the expression of genes to which they are operatively linked. Such vectors are 

15 referred to herein as "recombinant expression vectors" (or simply, "expression vectors"). 
In general, expression vectors of utility in recombinant DNA techniques are often in the 
form of plasmids. In the present specification, "plasmid" and "vector" may be used 
interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include other forms of expression vectors that serve equivalent 

20 functions. 

The term "recombinant host cell" (or simply "host cell"), as used herein, is 

intended to refer to a cell into which a recombinant expression vector has been introduced. 

It should be understood that such terms are intended to refer not only to the particular 

subject cell but to the progeny of such a cell. Because certain modifications may occur in 
25 succeeding generations due to either mutation or environmental influences, such progeny 

may not, in fact, be identical to the parent cell, but are still included within the scope of 

the term "host cell" as used herein. 

As used herein, the phrase "open reading frame" and the equivalent acronym 

"ORF" refers to that portion of a transcript-derived nucleic acid that can be translated in its 
30 entirety into a sequence of contiguous amino acids. As so defined, an ORF has length, 

measured in nucleotides, exactly divisible by 3. As so defined, an ORF need not encode 

the entirety of a natural protein. 



WO 2004/050900 



PCT/US2003/040131 



28 

As used herein, the phrase "ORF-encoded peptide" refers to the predicted or actual 
translation of an ORF. 

As used herein, the phrase "degenerate variant* ' of a reference nucleic acid 
sequence is meant to be inclusive of all nucleic acid sequences that can be directly 
5 translated, using the standard genetic code, to provide an amino acid sequence identical to 
that translated from the reference nucleic acid sequence. 

The term "polypeptide" encompasses both naturally occurring and non-naturally 
occurring proteins and polypeptides, as well as polypeptide fragments and polypeptide 
mutants, derivatives and analogs thereof. A polypeptide may be monomeric or polymeric. 

10 Further, a polypeptide may comprise a number of different modules within a single 

polypeptide each of which has one or more distinct activities. A preferred polypeptide in 
accordance with the invention comprises a CSP encoded by a nucleic acid molecule of the 
instant invention, or a fragment, mutant, analog or derivative thereof. 

The term "isolated protein" or "isolated polypeptide" is a protein or polypeptide 

15 that by virtue of its origin or source of derivation (1) is not associated with naturally 
associated components that accompany it in its native state, (2) is free of other proteins 
from the same species (3) is expressed by a cell from a different species, or (4) does not 
occur in nature. Thus, a polypeptide that is chemically synthesized or synthesized in a 
cellular system different from the cell from which it naturally originates will be "isolated" 

20 from its naturally associated components. A polypeptide or protein may also be rendered 
substantially free of naturally associated components by isolation, using protein 
purification techniques well known in the art. 

A protein or polypeptide is "substantially pure," "substantially homogeneous" or 
"substantially purified" when at least about 60% to 75% of a sample exhibits a single 

25 species of polypeptide. The polypeptide or protein may be monomeric or multimeric. A 
substantially pure polypeptide or protein will typically comprise about 50%, 60%, 70%, 
80% or 90% WAV of a protein sample, more usually about 95%, and preferably will be 
over 99% pure. Protein purity or homogeneity may be determined by a number of means 
well known in the art, such as polyacrylamide gel electrophoresis of a protein sample, 

30 followed by visualizing a single polypeptide band upon staining the gel with a stain well 
known in the art. For certain purposes, higher resolution may be provided by using HPLC 
or other means well known in the art for purification. 
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The term "fragment" when used herein with respect to polypeptides of the present 
invention refers to a polypeptide that has an amino-terminal and/or carboxy-terminal 
deletion compared to a fiill-length CSP. In a preferred embodiment, the fragment is a 
contiguous sequence in which the amino acid sequence of the fragment is identical to the 

5 corresponding positions in the naturally occurring polypeptide. Fragments typically are at 
least 5, 6, 7, 8, 9 or 10 amino acids long, preferably at least 12, 14, 16 or 18 amino acids 
long, more preferably at least 20 amino acids long, more preferably at least 25, 30, 35, 40 
or 45, amino acids, even more preferably at least 50 or 60 amino acids long, and even 
more preferably at least 70 amino acids long. 

10 A "derivative" when used herein with respect to polypeptides of the present 

invention refers to a polypeptide which is substantially similar in primary structural 
sequence to a CSP but which include, e.g. , in vivo or in vitro chemical and biochemical 
modifications that are not found in the CSP. Such modifications include, for example, 
acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, 

1 5 covalent attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide 
derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of 
phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, 
formation of covalent cross-links, formation of cystine, formation of pyroglutamate, 
formylation, gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, 

20 iodination, methylation, myristoylation, oxidation, proteolytic processing, 

phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
Other modifications include, e.g., labeling with radionuclides, and various enzymatic 
modifications, as will be readily appreciated by those skilled in the art. A variety of 

25 methods for labeling polypeptides and of substituents or labels useful for such purposes 

1 25 32 35 14 

are well known in the art, and include radioactive isotopes such as I, P, S, C and 
3 H, ligands which bind to labeled antiligands (e.g., antibodies), fluorophores, 
chemiluminescent agents, enzymes, and antiligands which can serve as specific binding 
pair members for a labeled ligand. The choice of label depends on the sensitivity required, 
30 ease of conjugation with the primer, stability requirements, and available instrumentation. 
Methods for labeling polypeptides are well known in the art. See Ausubel (1992), supra; 
Ausubel (1999), supra. 
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The term "fusion protein" refers to polypeptides of the present invention coupled 
to a heterologous amino acid sequence. Fusion proteins are useful because they can be 
constructed to contain two or more desired functional elements from two or more different 
proteins. A fusion protein comprises at least 10 contiguous amino acids from a 
5 polypeptide of interest, more preferably at least 20 or 30 amino acids, even more 

preferably at least 40, 50 or 60 amino acids, yet more preferably at least 75, 100 or 125 
amino acids. Fusion proteins can be produced recombinantly by constructing a nucleic 
acid sequence that encodes the polypeptide or a fragment thereof in frame with a nucleic 
acid sequence encoding a different protein or peptide and then expressing the fusion 

10 protein. Alternatively, a fusion protein can be produced chemically by crosslinking the 
polypeptide or a fragment thereof to another protein. 

The term "analog" refers to both polypeptide analogs and non-peptide analogs. 
The term "polypeptide analog" as used herein refers to a polypeptide that is comprised of a 
segment of at least 25 amino acids that has substantial identity to a portion of an amino 

1 5 acid sequence but which contains non-natural amino acids or non-natural inter-residue 
bonds. In a preferred embodiment, the analog has the same or similar biological activity 
as the native polypeptide. Typically, polypeptide analogs comprise a conservative amino 
acid substitution (or insertion or deletion) with respect to the naturally occurring sequence. 
Analogs typically are at least 20 amino acids long, preferably at least 50 amino acids long 

20 or longer, and can often be as long as a full-length naturally occurring polypeptide. 

The term "non-peptide analog" refers to a compound with properties that are 
analogous to those of a reference polypeptide. A non-peptide compound may also be 
termed a "peptide mimetic" or a "peptidomimetic." Such compounds are often developed 
with the aid of computerized molecular modeling. Peptide mimetics that are structurally 

25 similar to useful peptides may be used to produce an equivalent effect Generally, 

peptidomimetics are structurally similar to a paradigm polypeptide (i.e., a polypeptide that 
has a desired biochemical property or pharmacological activity), but have one or more 
peptide linkages optionally replaced by a linkage selected from the group consisting of: 
-CH 2 NH-, --CH 2 S-, -CH 2 -CH 2 -, -CH=CH-(cis and trans), -COCH 2 -, 

30 -CH(OH)CH2~, and -CH2SO-, by methods well known in the art. Systematic 

substitution of one or more amino acids of a consensus sequence with a D-amino acid of 
the same type (e.g., D-lysine in place of L-lysine) may also be used to generate more 
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stable peptides. In addition, constrained peptides comprising a consensus sequence or a 
substantially identical consensus sequence variation may be generated by methods known 
in the art (Rizo et al, Ann. Rev. Biochem. 61:387-418 (1992)). For example, one may add 
internal cysteine residues capable of forming intramolecular disulfide bridges which 
5 cyclize the peptide. 

The term "mutant" or "mutein" when referring to a polypeptide of the present 
invention relates to an amino acid sequence containing substitutions, insertions or 
deletions of one or more amino acids compared to the amino acid sequence of a CSP. A 
mutein may have one or more amino acid point substitutions, in which a single amino acid 
10 at a position has been changed to another amino acid, one or more insertions and/or 

deletions, in which one or more amino acids are inserted or deleted, respectively, in the 
sequence of the naturally occurring protein, and/or truncations of the amino acid sequence 
at either or both the amino or carboxy termini. Further, a mutein may have the same or 
different biological activity as the naturally occurring protein. For instance, a mutein may 
15 have an increased or decreased biological activity. A mutein has at least 50% sequence 
similarity to the wild type protein, preferred is 60% sequence similarity, more preferred is 
70% sequence similarity. Even more preferred are muteins having 80%, 85% or 90% 
sequence similarity to a CSP. In an even more preferred embodiment, a mutein exhibits 
95% sequence identity, even more preferably 97%, even more preferably 98% and even 
20 more preferably 99%. Sequence similarity may be measured by any common sequence 
analysis algorithm, such as GAP or BESTFIT or other variation Smith-Waterman 
alignment See, T. F. Smith and M. S. Waterman, J. Mol. Biol. 147:195-197 (1981) and 
W.R. Pearson, Genomics 11:635-650 (1991). 

Preferred amino acid substitutions are those which: (1) reduce susceptibility to 
25 proteolysis, (2) reduce susceptibility to oxidation, (3) alter binding affinity for forming 
protein complexes, (4) alter binding affinity or enzymatic activity, and (5) confer or 
modify other physicochemical or functional properties of such analogs. For example, 
single or multiple amino acid substitutions (preferably conservative amino acid 
substitutions) may be made in the naturally occurring sequence (preferably in the portion 
30 of the polypeptide outside the domain(s) forming intermolecular contacts. In a preferred 
embodiment, the amino acid substitutions are moderately conservative substitutions or 
conservative substitutions. In a more preferred embodiment, the amino acid substitutions 
are conservative substitutions. A conservative amino acid substitution should not 
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substantially change the structural characteristics of the parent sequence (e.g., a 
replacement amino acid should not tend to disrupt a helix that occurs in the parent 
sequence, or disrupt other types of secondary structure that characterize the parent 
sequence). Examples of art-recognized polypeptide secondary and tertiary structures are 
5 described in Creighton (e&), Proteins, Structures and Molecular Principles, W. H. 
Freeman and Company (1984); Branden et al (ed.), Introduction to Protein Structure, 
Garland Publishing (1991); Thornton et al 9 Nature 354:105-106 (1991). 

As used herein, the twenty conventional amino acids and their abbreviations follow 
conventional usage. See Golub et al (eds.), Immunology - A Synthesis 2 nd Ed., Sinauer 
10 Associates (1991). Stereoisomers (e.g., D-amino acids) of the twenty conventional amino 
acids, unnatural amino acids such as a-, oc-disubstituted amino acids, N-alkyl amino acids, 
and other unconventional amino acids may also be suitable components for polypeptides 
of the present invention. Examples of unconventional amino acids include: 

4- hydroxyproline, y-carboxyglutamate, e-N,N,N-trimethyllysine, s-N-acetyllysine, 
1 1 5 O-phosphoserine, N-acetylserine, N-formylmethionine, 3-methylhistidine, 

5- hydroxylysine, s-N-methylarginine, and other similar amino acids and imino acids (e.g., 
4-hydroxyproline). In the polypeptide notation used herein, the lefthand direction is the 
amino terminal direction and the right hand direction is the carboxy-terminal direction, in 
accordance with standard usage and convention. 

20 By "homology" or "homologous" when referring to a polypeptide of the present 

invention it is meant polypeptides from different organisms with a similar sequence to 
the encoded amino acid sequence of a CSP and a similar biological activity or function. 
Although two polypeptides are said to be "homologous," this does not imply that there is 
necessarily an evolutionary relationship between the polypeptides. Instead, the term 

25 "homologous" is defined to mean that the two polypeptides have similar amino acid 
sequences and similar biological activities or functions. In a preferred embodiment, a 
homologous polypeptide is one that exhibits 50% sequence similarity to CSP, preferred is 
60% sequence similarity, more preferred is 70% sequence similarity. Even more preferred 
are homologous polypeptides that exhibit 80%, 85% or 90% sequence similarity to a CSP. 

30 In yet a more preferred embodiment, a homologous polypeptide exhibits 95%, 97%, 98% 
or 99% sequence similarity. 

When "sequence similarity" is used in reference to polypeptides, it is recognized 
that residue positions that are not identical often differ by conservative amino acid 
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substitutions. In a preferred embodiment, a polypeptide that has "sequence similarity" 
comprises conservative or moderately conservative amino acid substitutions. A 
"conservative amino acid substitution" is one in which an amino acid residue is substituted 
by another amino acid residue having a side chain (R group) with similar chemical 
5 properties (e.g, charge or hydrophobicity). In general, a conservative amino acid 

substitution will not substantially change the functional properties of a protein. In cases 
where two or more amino acid sequences differ from each other by conservative 
substitutions, the percent sequence identity or degree of similarity may be adjusted 
upwards to correct for the conservative nature of the substitution. Means for making this 

10 adjustment are well known to those of skill in the art. See, e.g., Pearson, Methods Mol 
Biol 24:307-31 (1994). 

For instance, the following six groups each contain amino acids that are 
conservative substitutions for one another: 
1) Serine (S), Threonine (T); 

1 5 2) Aspartic Acid (D), Glutamic Acid (E); 

3) Asparagine (N), Glutamine (Q); 

4) Arginine (R), Lysine (K); 

5) Isoleucine (T), Leucine (L), Methionine (M), Alanine (A), Valine (V), and 

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 

20 Alternatively, a conservative replacement is any change having a positive value in 

the PAM250 log-likelihood matrix disclosed in Gonnet et al, Science 256: 1443-45 
(1992). A "moderately conservative" replacement is any change having a nonnegative 
value in the PAM250 log-likelihood matrix. 

Sequence similarity for polypeptides, which is also referred to as sequence 

25 identity, is typically measured using sequence analysis software. Protein analysis software 
matches similar sequences using measures of similarity assigned to various substitutions, 
deletions and other modifications, including conservative amino acid substitutions. For 
instance, GCG contains programs such as "Gap" and "Bestfif 5 which can be used with 
default parameters to determine sequence homology or sequence identity between closely 

30 related polypeptides, such as homologous polypeptides from different species of 

organisms or between a wild type protein and a muteih thereof. See, e.g, GCG Version 
6. 1 . Other programs include FASTA, discussed supra. 
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A preferred algorithm when comparing a sequence of the invention to a database 
containing a large number of sequences from different organisms is the computer program 
BLAST, especially blastp or tblastn. See, e.g., Altschul et al, J. Mol Biol. 215: 403-410 
(1990); Altschul et ah y Nucleic Acids Res. 25:3389-402 (1997). Preferred parameters for 
5 blastp are: 

Expectation value: 1 0 (default) 

Filter: seg (default) 

Cost to open a gap: 1 1 (default) 

Cost to extend a gap: 1 (default 
10 Max. alignments: 100 (default) 

Word size: 1 1 (default) 

No. of descriptions: 100 (default) 

Penally Matrix: BLOSUM62 

The length of polypeptide sequences compared for homology will generally be at 

1 5 least about 16 amino acid residues, usually at least about 20 residues, more usually at least 
about 24 residues, typically at least about 28 residues, and preferably more than about 35 
residues. When searching a database containing sequences from a large number of 
different organisms, it is preferable to compare amino acid sequences. 

Algorithms other than blastp for database searching using amino acid sequences 

20 are known in the art For instance, polypeptide sequences can be compared using FASTA, 
a program in GCG Version 6.1. FASTA (e.g., FASTA2 and FASTA3) provides 
alignments and percent sequence identity of the regions of the best overlap between the 
query and search sequences (Pearson (1990), supra; Pearson (2000), supra. For example, 
percent sequence identity between amino acid sequences can be determined using FASTA 

25 with its default or recommended parameters (a word size of 2 and the PAM250 scoring 
matrix), as provided in GCG Version 6.1. 

An "antibody" refers to an intact immunoglobulin, or to an antigen-binding portion 
thereof that competes with the intact antibody for specific binding to a molecular species, 
e.g. 9 a polypeptide of the instant invention. Antigen-binding portions may be produced by 

30 recombinant DNA techniques or by enzymatic or chemical cleavage of intact antibodies. 
Antigen-binding portions include, inter alia, Fab, Fab\ F(ab')2, Fv, dAb, and 
complementarity determining region (CDR) fragments, single-chain antibodies (scFv), 
chimeric antibodies, diabodies and polypeptides that contain at least a portion of an 
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immunoglobulin that is sufficient to confer specific antigen binding to the polypeptide. A 
Fab fragment is a monovalent fragment consisting of the VL, VH, CL and CHI domains; 
a F(ab')2 fragment is a bivalent fragment comprising two Fab fragments linked by a 
disulfide bridge at the hinge region; a Fd fragment consists of the VH and CHI domains; a 
5 Fv fragment consists of the VL and VH domains of a single arm of an antibody; and a dAb 
fragment consists of a VH domain. See, e.g., Ward et al 9 Nature 341: 544-546 (1989). 

By "bind specifically" and "specific binding" as used herein it is meant the ability 
of the antibody to bind to a first molecular species in preference to binding to other 
molecular species with which the antibody and first molecular species are admixed. An 
10 antibody is said to "recognize" a first molecular species when it can bind specifically to 
that first molecular species. 

A single-chain antibody (scFv) is an antibody in which VL and VH regions are 
paired to form a monovalent molecule via a synthetic linker that enables them to be made 
as a single protein chain. See, e.g., Bird et al 9 Science 242: 423-426 (1988); Huston et al 9 
15 Proc. Natl Acad. Set USA 85: 5879-5883 (1988). Diabodies are bivalent, bispecific 

antibodies in which VH and VL domains are expressed on a single polypeptide chain, but 
using a linker that is too short to allow for pairing between the two domains on the same 
chain, thereby forcing the domains to pair with complementary domains of another chain 
and creating two antigen binding sites. See e.g. , Holliger et al , Proc. Natl Acad. Set USA 
20 90: 6444-6448 (1993); Poljak et al, Structure 2: 1 121-1 123 (1994). One or more CDRs 
may be incorporated into a molecule either covalently or noncovalently to make it an 
immunoadhesin. An immunoadhesin may incorporate the CDR(s) as part of a larger 
polypeptide chain, may covalently link the CDR(s) to another polypeptide chain, or may 
incorporate the CDR(s) noncovalently. The CDRs permit the immunoadhesin to 
25 specifically bind to a particular antigen of interest. A chimeric antibody is an antibody 
that contains one or more regions from one antibody and one or more regions from one or 
more other antibodies. 

An antibody may have one or more binding sites. If there is more than one binding 
site, the binding sites may be identical to one another or may be different For instance, a 
30 naturally occurring immunoglobulin has two identical binding sites, a single-chain 
antibody or Fab fragment has one binding site, while a "bispecific" or 'TjifunctionaF 
antibody has two different binding sites. 
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An "isolated antibody" is an antibody that (1) is not associated with naturally- 
associated components, including other naturally-associated antibodies, that accompany it 
in its native state, (2) is free of other proteins from the same species, (3) is expressed by a 
cell from a different species, or (4) does not occur in nature. It is known that purified 

5 proteins, including purified antibodies, may be stabilized with non-naturally-associated 
components. The non-naturally-associated component may be a protein, such as albumin 
(e.g., BSA) or a chemical such as polyethylene glycol (PEG). 

A "neutralizing antibody" or "an inhibitory antibody" is an antibody that inhibits 
the activity of a polypeptide or blocks the binding of a polypeptide to a ligand that 

10 normally binds to it. An "activating antibody" is an antibody that increases the activity of 
a polypeptide. 

The term "epitope" includes any protein determinant capable of specific binding to 
an immunoglobulin or T-cell receptor. Epitopic determinants usually consist of 
chemically active surface groupings of molecules such as amino acids or sugar side chains 
15 and usually have specific three-dimensional structural characteristics, as well as specific 
charge characteristics. An antibody is said to specifically bind an antigen when the 
dissociation constant is less thanl yM, preferably less thanlOO nM and most preferably 
less than 10 nM. 

The term "patient" includes human and veterinary subjects. 
20 Throughout this specification and claims, the word "comprise," or variations such 

as "comprises" or "comprising," will be understood to imply the inclusion of a stated 
integer or group of integers but not the exclusion of any other integer or group of integers. 

The term "colon specific" refers to a nucleic acid molecule or polypeptide that is 
expressed predominantly in the colon as compared to other tissues in the body. In a 
25 preferred embodiment, a "colon specific" nucleic acid molecule or polypeptide is detected 
at a level that is 1 .5-fold higher than any other tissue in the body. In a more preferred 
embodiment, the "colon specific" nucleic acid molecule or polypeptide is detected at a 
level that is 2-fold higher than any other tissue in the body, more preferably 5-fold higher, 
still more preferably at least 10-fold, 15-fold, 20-fold, 25-fold, 50-fold or 100-fold higher 
30 than any other tissue in the body. Nucleic acid molecule levels may be measured by 
nucleic acid hybridization, such as Northern blot hybridization, or quantitative PCR. 
Polypeptide levels may be measured by any method known to accurately quantitate 
protein levels, such as Western blot analysis. 
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Nucleic Acid Molecules 1 Regulatory Sequences, Vectors. Host Cells and Recombinant 
Methods of Making Polypeptides 
Nucleic Acid Molecules 

One aspect of the invention provides isolated nucleic acid molecules that are 

5 specific to the colon or to colon cells or tissue or that are derived from such nucleic acid 
molecules. These isolated colon specific nucleic acids (CSNAs) may comprise cDNA 
genomic DNA, RNA, or a combination thereof, a fragment of one of these nucleic acids, 
or may be a non-naturally occurring nucleic acid molecule. A CSNA may be derived from 
an animal. In a preferred embodiment, the CSNA is derived from a human or other 

10 mammal. In a more preferred embodiment, Ihe CSNA is derived from a human or other 
primate. In an even more preferred embodiment, the CSNA is derived from a human. 

In a preferred embodiment, the nucleic acid molecule encodes a polypeptide that 
is specific to colon, a colon-specific polypeptide (CSP). In a more preferred embodiment, 
the nucleic acid molecule encodes a polypeptide that comprises an amino acid sequence of 

15 SEQ ID NO: 1 13-259. In another highly preferred embodiment, the nucleic acid molecule 
comprises a nucleic acid sequence of SEQ ID NO: 1-112. Nucleotide sequences of the 
instantly-described nucleic acid molecules were determined by assembling several DNA 
molecules from either public or proprietary databases. Some of the underlying DNA 
sequences are the result, directly or indirectly, of at least one enzymatic polymerization 

20 reaction (e.g., reverse transcription and/or polymerase chain reaction) using an automated 
sequencer (such as the MegaBACE™ 1000, Amersham Biosciences, Sunnyvale, CA, 
USA). 

Nucleic acid molecules of the present invention may also comprise sequences that 
selectively hybridize to a nucleic acid molecule encoding a CSNA or a complement or 

25 antisense thereof. The hybridizing nucleic acid molecule may or may not encode a 

polypeptide or may or may not encode a CSP. However, in a preferred embodiment, the 
hybridizing nucleic acid molecule encodes a CSP. In a more preferred embodiment, the 
invention provides a nucleic acid molecule that selectively hybridizes to a nucleic acid 
molecule or the antisense sequence of a nucleic acid molecule that encodes a polypeptide 

30 comprising an amino acid sequence of SEQ ID NO: 1 13-259. In an even more preferred 
embodiment, the invention provides a nucleic acid molecule that selectively hybridizes to 
a nucleic acid molecule comprising the nucleic acid sequence of SEQ ID NO: 1-1 12 or the 
antisense sequence thereof. Preferably, the nucleic acid molecule selectively hybridizes to 
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a nucleic acid molecule or the antisense sequence of a nucleic acid molecule encoding a 
CSP under low stringency conditions. More preferably, the nucleic acid molecule 
selectively hybridizes to a nucleic acid molecule or the antisense sequence of a nucleic 
acid molecule encoding a CSP under moderate stringency conditions. Most preferably, 

5 the nucleic acid molecule selectively hybridizes to a nucleic acid molecule or the antisense 
sequence of a nucleic acid molecule encoding a CSP under high stringency conditions. In 
a preferred embodiment, the nucleic acid molecule hybridizes under low, moderate or high 
stringency conditions to a nucleic acid molecule or the antisense sequence of a nucleic 
acid molecule encoding a polypeptide comprising an amino acid sequence of SEQ ID NO: 

10 113-259. Ina more preferred embodiment, the nucleic acid molecule hybridizes under 
low, moderate or high stringency conditions to a nucleic acid molecule or the antisense 
sequence of a nucleic acid molecule comprising a nucleic acid sequence selected from 
SEQ ID NO: 1-112. 

Nucleic acid molecules of the present invention may also comprise nucleic acid 

15 sequences that exhibit substantial sequence similarity to a nucleic acid encoding a CSP or 
a complement of the encoding nucleic acid molecule. In this embodiment, it is preferred 
that the nucleic acid molecule exhibit substantial sequence similarity to a nucleic acid 
molecule encoding human CSP. More preferred is a nucleic acid molecule exhibiting 
substantial sequence similarity to a nucleic acid molecule encoding a polypeptide having 

20 an amino acid sequence of SEQ ID NO: 1 13-259. By substantial sequence similarity it is 
meant a nucleic acid molecule having at least 60%, more preferably at least 70%, even 
more preferably at least 80% and even more preferably at least 85% sequence identity 
with a nucleic acid molecule encoding a CSP, such as a polypeptide having an amino acid 
sequence of SEQ ID NO: 1 13-259. In a more preferred embodiment, the similar nucleic 

25 acid molecule is one that has at least 90%, more preferably at least 95%, more preferably 
at least 97%, even more preferably at least 98%, and still more preferably at least 99% 
sequence identity with a nucleic acid molecule encoding a CSP. Most preferred in this 
embodiment is a nucleic acid molecule that has at least 99.5%, 99.6%, 99.7%, 99.8% or 
99.9% sequence identity with a nucleic acid molecule encoding a CSP. 

30 The nucleic acid molecules of the present invention are also inclusive of those 

exhibiting substantial sequence similarity to a CSNA or its complement In this 
embodiment, it is preferred that the nucleic acid molecule exhibit substantial sequence 
similarity to a nucleic acid molecule having a nucleic acid sequence of SEQ ID NO: 1- 
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1 12. By substantial sequence similarity it is meant a nucleic acid molecule that has at 
least 60%, more preferably at least 70%, even more preferably at least 80% and even more 
preferably at least 85% sequence identity with a CSNA, such as one having a nucleic acid 
sequence of SEQ ID NO: 1-112. More preferred is a nucleic acid molecule that has at 

5 least 90%, more preferably at least 95%, more preferably at least 97%, even more 

preferably at least 98%, and still more preferably at least 99% sequence identity with a 
CSNA. Most preferred is a nucleic acid molecule that has at least 99.5%, 99.6%, 99.7%, 
99.8% or 99.9% sequence identity with a CSNA. 

Nucleic acid molecules that exhibit substantial sequence similarity are inclusive of 

10 sequences that exhibit sequence identity over their entire length to a CSNA or to a nucleic 
acid molecule encoding a CSP, as well as sequences that are similar over only a part of its 
length. In this case, the part is at least 50 nucleotides of the CSNA or the nucleic acid 
molecule encoding a CSP, preferably at least 100 nucleotides, more preferably at least 150 
or 200 nucleotides, even more preferably at least 250 or 300 nucleotides, still more 

1 5 preferably at least 400 or 500 nucleotides. 

The substantially similar nucleic acid molecule may be a naturally occurring one 
that is derived from another species, especially one derived from another primate, wherein 
the similar nucleic acid molecule encodes an amino acid sequence that exhibits significant 
sequence identity to that of SEQ ID NO: 1 13-259 or demonstrates significant sequence 

20 identity to the nucleotide sequence of SEQ ID NO: 1-1 12. The similar nucleic acid 

molecule may also be a naturally occurring nucleic acid molecule from a human, when the 
CSNA is a member of a gene family. The similar nucleic acid molecule may also be a 
naturally occurring nucleic acid molecule derived from a non-primate, mammalian 
species, including without limitation, domesticated species, e.g., dog, cat, mouse, rat, 

25 rabbit, hamster, cow, horse and pig; and wild animals, e.g. 9 monkey, fox, lions, tigers, 

bears, giraffes, zebras, etc. The substantially similar nucleic acid molecule may also be a 
naturally occurring nucleic acid molecule derived from a non-mammalian species, such as 
birds or reptiles. The naturally occurring substantially similar nucleic acid molecule may 
be isolated directly from humans or other species. In another embodiment, the 

30 substantially similar nucleic acid molecule may be one that is experimentally produced by 
random mutation of a nucleic acid molecule. In another embodiment, the substantially 
similar nucleic acid molecule may be one that is experimentally produced by directed 
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mutation of a CSNA. In a preferred embodiment, the substantially similar nucleic acid 
molecule is a CSNA. 

The nucleic acid molecules of the present invention are also inclusive of allelic 
variants of a CSNA or a nucleic acid encoding a CSP. For example, single nucleotide 

5 polymorphisms (SNPs) occur frequently in eukaryotic genomes and the sequence 

determined from one individual of a species may differ from other allelic forms present 
within the population. More than 1 .4 million SNPs have already been identified in the 
human genome, International Human Genome Sequencing Consortium, Nature 409: 860- 
921 (2001) - Valiants with small deletions and insertions of more than a single nucleotide 

10 are also found in the general population, and often do not alter the function of the protein. 
In addition, amino acid substitutions occur frequently among natural allelic variants, and 
often do not substantially change protein function. 

In a preferred embodiment, the allelic variant is a variant of a gene, wherein the 
gene is transcribed into a mRNA that encodes a CSP. In a more preferred embodiment, 

15 the gene is transcribed into a mRNA that encodes a CSP comprising an amino acid 

sequence of SEQ ID NO: 1 13-259. In another preferred embodiment, the allelic variant is 
a variant of a gene, wherein the gene is transcribed into a mRNA that is a CSNA. In a 
more preferred embodiment, the gene is transcribed into a mRNA that comprises the 
nucleic acid sequence of SEQ ID NO: 1-112. Also preferred is that the allelic variant be a 

20 naturally occurring allelic variant in the species of interest, particularly human. 

Nucleic acid molecules of the present invention are also inclusive of nucleic acid 
sequences comprising a part of a nucleic acid sequence of the instant invention. The part 
may or may not encode a polypeptide, and may or may not encode a polypeptide that is a 
CSP. In a preferred embodiment, the part encodes a CSP. In one embodiment, the 

25 nucleic acid molecule comprises a part of a CSNA. In another embodiment, the nucleic 
acid molecule comprises a part of a nucleic acid molecule that hybridizes or exhibits 
substantial sequence similarity to a CSNA. In another embodiment, the nucleic acid 
molecule comprises a part of a nucleic acid molecule that is an allelic variant of a CSNA. 
In yet another embodiment, the nucleic acid molecule comprises a part of a nucleic acid 

30 molecule that encodes a CSP. A part comprises at least 10 nucleotides, more preferably at 
least 15, 17, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400 or 
500 nucleotides. The maximum size of a nucleic acid part is one nucleotide shorter than 
the sequence of the nucleic acid molecule encoding the full-length protein. 
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Nucleic acid molecules of the present invention are also inclusive of nucleic acid 
sequences that encode fusion proteins, homologous proteins, polypeptide fragments, 
muteins and polypeptide analogs, as described infra. 

Nucleic acid molecules of the present invention are also inclusive of nucleic acid 

5 sequences containing modifications of the native nucleic acid molecule. Examples of such 
modifications include, but are not limited to, normative internucleoside bonds, post- 
synthetic modifications or altered nucleotide analogues. One having ordinary skill in the 
art would recognize that the type of modification that may be made will depend upon the 
intended use of the nucleic acid molecule. For instance, when the nucleic acid molecule is 

10 used as a hybridization probe, the range of such modifications will be limited to those that 
permit sequence-discriminating base pairing of the resulting nucleic acid. When used to 
direct expression of RNA or protein in vitro or in vivo, the range of such modifications 
will be limited to those that permit the nucleic acid to function properly as a 
polymerization substrate. When the isolated nucleic acid is used as a therapeutic agent, 

15 the modifications will be limited to those that do not confer toxicity upon the isolated 
nucleic acid. 

Accordingly, in one embodiment, a nucleic acid molecule may include nucleotide 
analogues that incorporate labels that are directly detectable, such as radiolabels or 
fluorophores, or nucleotide analogues that incorporate labels that can be visualized in a 

20 subsequent reaction, such as biotin or various haptens. The labeled nucleic acid molecules 
are particularly useful as hybridization probes. 

Common radiolabeled analogues include those labeled with 33 P, 32 P, and 35 S, such 
as a- 32 P-dATP, a- 32 P-dCTP, a- 32 P-dGTP, a- 32 P-dTTP, a- 32 P-3'dATP, a- 32 P-ATP, a- 32 P- 
CTP, a- 32 P-GTP, a- 32 P-UTP, a- 35 S-dATP, Y - 35 S-GTP, Y - 33 P-dATP, and the like. 

25 Commercially available fluorescent nucleotide analogues readily incorporated into 

the nucleic acids of the present invention include Cy3-dCTP, Cy3-dUTP, Cy5-dCTP, Cy3- 
dUTP (Amersham Biosciences, Piscataway, New Jersey, USA), fluorescein-12-dUTP, 
tetramethylrhodamine-6-dUTP 3 Texas Red®-5-dUTP, Cascade Blue®-7-dUTP, 
BODIPY® FL-14-dUTP, BODIPY® TMR-14-dUTP, BODIPY® TR-14-dUTP, 

30 Rhodamine Green™-5-dUTP, Oregon Green® 488-5-dUTP, Texas Red®-12-dUTP, 

BODIPY® 630/650-14-dUTP, BODIPY® 650/665-14-dUTP, Alexa Fluor® 488-5-dUTP, 
Alexa Fluor® 532-5-dUTP, Alexa Fluor® 568-5-dUTP, Alexa Fluor® 594-5-dUTP, 
Alexa Fluor® 546-14-dUTP, fluorescein-12-UTP, tetrametiykhodamine-6-UTP, Texas 
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Red®-5-UTP, Cascade Blue®-7-UTP, BODIPY® FL-14-UTP, BODDPY® TMR-14-UTP, 
BODDPY® TR-14-UTP, Rhodamine Green™-5-UTP, Alexa Fluor® 488-5-UTP, Alexa 
Fluor® 546-14-UTP (Molecular Probes, Inc. Eugene, OR, USA). One may also custom 
synthesize nucleotides having other fluorophores. See Henegariu et al. 9 Nature 
5 Biotechnol 18: 345-348 (2000). 

Haptens that are commonly conjugated to nucleotides for subsequent labeling 
include biotin (biotin-1 1-dUTP, Molecular Probes, Inc., Eugene, OR, USA; 
biotin-21-UTP, biotin-21-dUTP, Clontech Laboratories, Inc., Palo Alto, CA, USA), 
digoxigenin (DIG-ll-dUTP, alkali labile, DIG-11-UTP, Roche Diagnostics Corp., 

10 Indianapolis, IN, USA), and dinitrophenyl (dinitrophenyl-1 1-dUTP, Molecular Probes, 
Inc., Eugene, OR, USA). 

Nucleic acid molecules of the present invention can be labeled by incorporation of 
labeled nucleotide analogues into the nucleic acid. Such analogues can be incorporated by 
enzymatic polymerization, such as by nick translation, random priming, polymerase chain 

15 reaction (PCR), terminal transferase tailing, and end-filling of overhangs, for DNA 

molecules, and in vitro transcription driven, e.g., from phage promoters, such as 17, T3, 
and SP6, for RNA molecules. Commercial kits are readily available for each such 
labeling approach. Analogues can also be incorporated during automated solid phase 
chemical synthesis. Labels can also be incorporated after nucleic acid synthesis, with the 

20 5' phosphate and 3' hydroxyl providing convenient sites for post-synthetic covalent 
attachment of detectable labels. 

Other post-synthetic approaches also permit internal labeling of nucleic acids. For 
example, fluorophores can be attached using a cisplatin reagent that reacts with the N7 of 
guanine residues (and, to a lesser extent, adenine bases) in DNA, RNA, and Peptide 

25 Nucleic Acids (PNA) to provide a stable coordination complex between the nucleic acid 
and fluorophore label (Universal Linkage System) (available from Molecular Probes, Inc., 
Eugene, OR, USA and Amersham Pharmacia Biotech, Piscataway, NJ, USA); see Alers et 
al 9 Genes, Chromosomes & Cancer 25: 301- 305 (1999); Jelsma et al. 9 J. NIHRes. 5: 82 
(1994); Van Belkum et al. 9 BioTechniques 16: 148-153 (1994). Alternatively, nucleic 

30 acids can be labeled using a disulfide-containing linker (FastTag™ Reagent, Vector 

Laboratories, Inc., Burlingame, CA, USA) that is photo- or thermally coupled to the target 
nucleic acid using aryl azide chemistry; after reduction, a free thiol is available for 
coupling to a hapten, fluorophore, sugar, affinity ligand, or other marker. 
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One or more independent or interacting labels can be incorporated into the nucleic 
acid molecules of the present invention. For example, both a fluorophore and a moiety 
that in proximity thereto acts to quench fluorescence can be included to report specific 
hybridization through release of fluorescence quenching or to report exonucleotidic 
5 excision. See, e.g., Tyagi et al, Nature Biotechnol 14: 303-308 (1996); Tyagi et al, 
Nature Biotechnol 16: 49-53 (1998); Sokol et al, Proc. Natl Acad. Sci. USA 95: 
11538-11543 (1998); Kostrikis etal, Science 279: 1228-1229 (1998); Marras et al, 
Genet. Anal 14: 151-156 (1999); Holland et al 9 Proc. Natl Acad. Sci. USA 88: 
7276-7280 (1991); Heid et al, Genome Res. 6(10): 986-94 (1996); Kuimelis et al, 

10 Nucleic Acids Symp. Ser. (37): 255-6 (1997); and U.S. Patent Nos. 5,846,726, 5,925,517, 
5,925,517, 5,723,591 and 5,538,848, the disclosures of which are incorporated herein by 
reference in their entireties. 

Nucleic acid molecules of the present invention may also be modified by altering 
one or more native phosphodiester internucleoside bonds to more nuclease-resistant, 

15 internucleoside bonds. See Hartmann et al (eds.), Manual of Antisense Methodology: 
Perspectives in Antisense Science. Kluwer Law International (1999); Stein et al (eds.), 
Applied Antisense Oligonucleotide Technology. Wiley-Liss (1998); Chadwick et al 
(eds.), Oligonucleotides as Therapeutic Agents — Symposium No. 209. John Wiley & Son 
Ltd (1997). Such altered internucleoside bonds are often desired for techniques or for 

20 targeted gene correction, Gamper et al, Nucl Acids Res. 28(21): 4332-4339 (2000). For 
double-stranded RNA inhibition which may utilize either natural ds RNA or ds RNA 
modified in its, sugar, phosphate or base, see Hannon, Nature 418(1 1): 244-251 (2002); 
Fire et al in WO 99/32619; Tuschl et al in US2002/0086356; Kruetzer et al in WO 
00/44895, the disclosures of which are incorporated herein by reference in their entirety. 

25 For circular antisense, see Kool in U.S. Patent No. 5,426,180, the disclosure of which is 
incorporated herein by reference in its entirety. 

Modified oligonucleotide backbones include, without limitation, 
phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, 
aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3'-alkylene 

30 phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 

3 '-amino phosphoramidate and aminoalkylphosphoramidates, thiohophosphoramidates, 
thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having 
normal 3'-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity 
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wherein the adjacent pairs of nucleoside units are linked 3'-5' to 5'-3' or 2'-5' to 5'-2\ 
Representative U.S. Patents that teach the preparation of the above phosphorus-containing 
linkages include, but are not limited to, U.S. Patent Nos. 3,687,808; 4,469,863; 4,476,301; 
5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 
5 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 
5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; and 5,625,050, the disclosures of 
which are incorporated herein by reference in their entireties. In a preferred embodiment, 
the modified internucleoside linkages may be used for antisense techniques. 

Other modified oligonucleotide backbones do not include a phosphorus atom, but 
10 have backbones that are formed by short chain alkyl or cycloalkyl internucleoside 

linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or 
more short chain heteroatomic or heterocyclic internucleoside linkages. These include 
those having morpholino linkages (formed in part from the sugar portion of a nucleoside); 
siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and 
15 thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene 
containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino 
backbones; sulfonate and sulfonamide backbones; amide backbones; and others having 
mixed N, O, S and CH2 component parts. Representative U.S. patents that teach the 
preparation of the above backbones include, but are not limited to, U.S. Patent Nos. 
20 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 
5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 
5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 
5,633,360; 5,677,437 and 5,677,439; the disclosures of which are incorporated herein by 
reference in their entireties. 
25 In other preferred nucleic acid molecules, both the sugar and the internucleoside 

linkage are replaced with novel groups, such as peptide nucleic acids (PNA). In PNA 
compounds, the phosphodiester backbone of the nucleic acid is replaced with an amide- 
containing backbone, in particular by repeating N-(2-aminoethyl) glycine units linked by 
amide bonds. Nucleobases are bound directly or indirectly to aza nitrogen atoms of the 
30 amide portion of the backbone, typically by methylene carbonyl linkages. PNA can be 
synthesized using a modified peptide synthesis protocol. PNA oligomers can be 
synthesized by both Fmoc and tBoc methods. Representative U.S. patents that teach the 
preparation of PNA compounds include, but are not limited to, U.S. Patent Nos. 
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5,539,082; 5,714,331; and 5,719,262, each of which is herein incorporated by reference in 
its entirety. Automated PNA synthesis is readily achievable on commercial synthesizers 
{see, e.g., 'TNA User's Guide," Rev. 2, February 1998, Perseptive Biosystems Part No. 
60138, Applied Biosystems, Inc., Foster City, CA). PNA molecules are advantageous for 
5 a number of reasons. First, because the PNA backbone is uncharged, PNA/DNA and 
PNA/RNA duplexes have a higher thermal stability than is found in DNA/DNA and 
DNA/RNA duplexes. The Tm of a PNA/DNA or PNA/RNA duplex is generally 1°C 
higher per base pair than the Tm of the corresponding DNA/DNA or DNA/RNA duplex 
(in 100 mM NaCl). Second, PNA molecules can also form stable PNA/DNA complexes 
10 at low ionic strength, under conditions in which DNA/DNA duplex formation does not 
occur. Third, PNA also demonstrates greater specificity in binding to complementary 
DNA because a PNA/DNA mismatch is more destabilizing than DNA/DNA mismatch. A 
single mismatch in mixed a PNA/DNA 15-mer lowers the Tm by 8-20°C (15°C on 
average). In the corresponding DNA/DNA duplexes, a single mismatch lowers the Tm by 
15 4-16°C (1 1°C on average). Because PNA probes can be significantly shorter than DNA 
probes, their specificity is greater. Fourth, PNA oligomers are resistant to degradation by 
enzymes, and the lifetime of these compounds is extended both in vivo and in vitro 
because nucleases and proteases do not recognize the PNA polyamide backbone with 
nucleobase sidechains. See, e.g., Ray et al, FASEB J. 14(9): 1041-60 (2000); Nielsen et 
20 al, Pharmacol Toxicol 86(1): 3-7 (2000); Larsen et al, Biochim Biophys Acta. 1489(1): 
159-66 (1999); Nielsen, Curr. Opin. Struct. Biol. 9(3): 353-7 (1999), and Nielsen, Curr. 
Opin. Biotechnol. 10(1): 71-5 (1999). 

Nucleic acid molecules may be modified compared to their native structure 
throughout the length of the nucleic acid molecule or can be localized to discrete portions 
25 thereof. As an example of the latter, chimeric nucleic acids can be synthesized that have 
discrete DNA and RNA domains and that can be used for targeted gene repair and 
modified PGR reactions, as further described in, Misra et at, Biochem. 37: 1917-1925 
(1998); and Finn et al, Nucl. Acids Res. 24: 3357-3363 (1996), and U.S. Patent Nos. 
5,760,012 and 5,731,181, the disclosures of which are incorporated herein by reference in 
30 their entireties. 

Unless otherwise specified, nucleic acid molecules of the present invention can 
include any topological conformation appropriate to the desired use; the term thus 
explicitly comprehends, among others, single-stranded, double-stranded, triplexed, 
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quadruplexed, partially double-stranded, partially-triplexed, partially-quadruplexed, 
branched, hairpinned, circular, and padlocked conformations. Padlocked conformations 
and their utilities are further described in Ban6r et al 9 Curr. Opin. Biotechnol 12: 1 1-15 
(2001); Escude et al 9 Proc. Natl Acad Sci. USA 14: 96(1 9): 10603-7 (1999); andNilsson 
5 et ah, Science 265(5181): 2085-8 (1994). Triplexed and quadruplexed conformations, and 
their utilities, are reviewed in Praseuth et al 9 Biochim. Biophys. Acta. 1489(1): 181-206 

(1999) ; Fox, Curr. Med. Chem. 7(1): 17-37 (2000); Kochetkova et al 9 Methods Mol Biol 
130: 189-201 (2000); Chan et al 9 J. Mol Med. 75(4): 267-82 (1997); Rowley et al, Mol 
Med5{y$)\ 693-700 (1999); Kool, Annu Rev Biophys Biomol Struct. 25: 1-28(1996). 

1 0 SNP Polymorphisms 

Commonly, sequence differences between individuals involve differences in single 
nucleotide positions. SNPs may account for 90% of human DNA polymorphism. Collins 
et al. 9 8 Genome Res. 1229-31 (1998). SNPs include single base pair positions in genomic 
DNA at which different sequence alternatives (alleles) exist in a population. In addition, 

15 the least frequent allele generally must occur at a frequency of 1% or greater. DNA 
sequence variants with a reasonably high population frequency are observed 
approximately every 1,000 nucleotide across the genome, with estimates as high as 1 SNP 
per 350 base pairs. Wang et al 9 280 Science 1077-82 (1998); Harding et al 9 60 Am. J. 
Human Genet. 772-89 (1997); Taillon-Miller et al. 9 8 Genome Res. 748-54 (1998); Cargill 

20 et al 9 22 Nat. Genet. 231-38 (1999); and Semple et al. 9 16 Bioinform. Disc. Note 735-38 

(2000) . The frequency of SNPs varies with the type and location of the change. In base 
substitutions, two-thirds of the substitutions involve the C-T and G-A type. This variation 
in frequency can be related to 5-methylcytosine deamination reactions that occur 
frequently, particularly at CpG dinucleotides. Regarding location, SNPs occur at a much 

25 higjier frequency in non-coding regions than in coding regions. Information on over one 
million variable sequences is already publicly available via the Internet and more such 
markers are available from commercial providers of genetic information. Kwok and Gu, 5 
Med. Today 538-53 (1999). 

Several definitions of SNPs exist See, e.g., Brooks, 235 Gene 177-86 (1999). As 

30 used herein, the term "single nucleotide polymorphism" or "SNP" includes all single base 
variants, thus including nucleotide insertions and deletions in addition to single nucleotide 
substitutions. There are two types of nucleotide substitutions. A transition is the 
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replacement of one purine by another purine or one pyrimidine by another pyrimidine. A 
transversion is the replacement of a purine for a pyrimidine, or vice versa. 

Numerous methods exist for detecting SNPs within a nucleotide sequence. A 
review of many of these methods can be found in Landegren et aL, 8 Genome Res. 769-76 

5 (1998). For example, a SNP in a genomic sample can be detected by preparing a Reduced 
Complexity Genome (RCG) from the genomic sample, then analyzing the RCG for the 
presence or absence of a SNP. See, e.g., WO 00/18960 which is herein incorporated by 
reference in its entirety. Multiple SNPs in a population of target polynucleotides in parallel 
can be detected using, for example, the methods of WO 00/50869 which is herein 

10 incorporated by reference in its entirety. Other SNP detection methods include the 
methods of U.S. Pat. Nos. 6,297,018 and 6,322,980 which are herein incorporated by 
reference in their entirety. Furthermore, SNPs can be detected by restriction fragment 
length polymorphism (RFLP) analysis. See, e.g., U.S. Pat. Nos. 5,324,631; 5,645,995 
which are herein incorporated by reference in their entirety. RFLP analysis of SNPs, 

15 however, is limited to cases where the SNP either creates or destroys a restriction enzyme 
cleavage site. SNPs can also be detected by direct sequencing of the nucleotide sequence 
of interest In addition, numerous assays based on hybridization have also been developed 
to detect SNPs and mismatch distinction by polymerases and ligases. Several web sites 
provide information about SNPs including Ensembl on the World Wide Web at 

20 ensemble.org, Sanger Institute on the World Wide Web at sanger.ac.uk/genetics/exon/, 
National Center for Biotechnology Information (NCBI) on the World Wide Web at 
ncbi.nlm.nih.gov/SNP/ , The SNP Consortium Ltd. on the World Wide Web at 
snp.cshl.org. The chromosomal locations for the compositions disclosed herein are 
provided below. In addition, one of ordinary skill in the art could use a BLAST against 

25 the genome or any of the databases cited above to find the chromosomal location. 

Another a preferred method to find the genomic coordinates and associated SNPs would 
be to use the BLAT tool (genome.ucsc.edu, Kent et al. 2001, The Human Genome 
Browser at UCSC, Genome Research 996-1006 or Kent 2002 BLAT —The BLAST -Like 
Alignment Tool Genome Reseach, 1-9 ). All web sites above were accessed December 3, 

30 2003. 

RNA interference 
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RNA interference refers to the process of sequence-specific post transcriptional 
gene silencing in animals mediated by short interfering RNAs (siRNA). Fire et aL, 1998, 
Nature, 391, 806. The corresponding process in plants is commonly referred to as post 
transcriptional gene silencing or RNA silencing and is also referred to as quelling in fungi. 

5 The process of post transcriptional gene silencing is thought to be an evolutionarily 
conserved cellular defense mechanism used to prevent the expression of foreign genes 
which is commonly shared by diverse flora and phyla. Fire et aL, 1999, Trends Genet, 
15, 358. Such protection from foreign gene expression may have evolved in response to 
the production of double-stranded RNAs (dsRNA) derived from viral infection or the 
10 random integration of transposon elements into a host genome via a cellular response that 
specifically destroys homologous single-stranded RNA or viral genomic RNA. The 
presence of dsRNA in cells triggers the RNAi response though a mechanism that has yet 
to be fully characterized. This mechanism appears to be different from the interferon 
response that results from dsRNA mediated activation of protein kinase PKR and 2\5 % - 

1 5 oligoadenylate synthetase resulting in non-specific cleavage of mRNA by ribonuclease L. 

The presence of long dsRNAs in cells stimulates the activity of a ribonuclease m 
enzyme referred to as dicer. Dicer is involved in the processing of the dsRNA into short 
pieces of dsRNA known as short interfering RNAs (siRNA). Berstein et aL, 2001, 
Nature, 409, 363. Short interfering RNAs derived from dicer activity are typically about 

20 21-23 nucleotides in length and comprise about 19 base pair duplexes. Dicer has also been 
implicated in the excision of 21 and 22 nucleotide small temporal RNAs (stRNA) from 
precursor RNA of conserved structure that are implicated in translational control. 
Hutvagner et aL, 2001, Science, 293, 834. The RNAi response also features an 
endonuclease complex containing a siRNA, commonly referred to as an RNA-induced 

25 silencing complex (RISC), which mediates cleavage of single-stranded RNA having 
sequence complementary to the antisense strand of the siRNA duplex. Cleavage of the 
target RNA takes place in the middle of the region complementary to the antisense strand 
of the siRNA duplex. Elbashir et aL, 2001, Genes Dev., 15, 188. 

Short interfering RNA mediated RNAi has been studied in a variety of systems. 
30 Fire et aL, 1998, Nature, 391, 806, were the first to observe RNAi in C. Elegans. Wianny 
and Goetz, 1999, Nature Cell Biol. , 2, 70, describe RNAi mediated by dsRNA in mouse 
embryos. Hammond et aL, 2000, Nature, 404, 293, describe RNAi in Drosophila cells 
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transfected with dsRNA. Elbashir et al, 2001, Nature, 411, 494, describe RNAi induced 
by introduction of duplexes of synthetic 21 -nucleotide RNAs in cultured mammalian cells 
including human embryonic kidney and HeLa cells. Recent work in Drosophila embryonic 
lysates (Elbashir et al, 2001, EMBO J., 20, 6877) has revealed certain requirements for 

5 siRNA length, structure, chemical composition, and sequence that are essential to mediate 
efficient RNAi activity. These studies have shown that 21 nucleotide siRNA duplexes are 
most active when containing two nucleotide 3'-overhangs. Furthermore, complete 
substitution of one or both siRNA strands with 2 , -deoxy (2 ! -H) or 2'-0-methyl nucleotides 
abolishes RNAi activity, whereas substitution of the 3'-terminal siRNA overhang 

10 nucleotides with deoxy nucleotides (2-H) was shown to be tolerated. Single mismatch 

sequences in the center of the siRNA duplex were also shown to abolish RNAi activity. In 
addition, these studies also indicate that the position of the cleavage site in the target RNA 
is defined by the 5'-end of the siRNA guide sequence rather than the 3 ! -end. Elbashir et 
al., 2001, EMBO J., 20, 6877. Other studies have indicated that a S'-phosphate on the 

1 5 target-complementary strand of a siRNA duplex is required for siRNA activity and that 
ATP is utilized to maintain the 5 f -phosphate moiety on the siRNA. Nykanen et al., 2001, 
Cell, 107, 309. 

Studies have shown that replacing the 3 -overhanging segments of a 21-mer siRNA 
duplex having 2 nucleotide 3 1 overhangs with deoxyribonucleotides does not have an 

20 adverse effect on RNAi activity. Replacing up to 4 nucleotides on each end of the siRNA 
with deoxyribonucleotides has been reported to be well tolerated whereas complete 
substitution with deoxyribonucleotides results in no RNAi activity. Elbashir et al, 2001, 
EMBO J., 20, 6877. In addition, Elbashir et al., supra, also report that substitution of 
siRNA with 2 ! -0-methyl nucleotides completely abolishes RNAi activity. Li et al., WO 

25 00/44914, and Beach et al., WO 01/68836 both suggest that siRNA "may include 

modifications to either the phosphate-sugar back bone or the nucleoside to include at least 
one of a nitrogen or sulfur heteroatom" , however neither application teaches to what extent 
these modifications are tolerated in siRNA molecules nor provide any examples of such 
modified siRNA. Kreutzer and Limmer, Canadian Patent Application No. 2,359,180, also 

30 describe certain chemical modifications for use in dsRNA constructs in order to counteract 
activation of double-stranded RNA-dependent protein kinase PKR, specifically 2 , -amino 
or 2 , -0-methyl nucleotides, and nucleotides containing a 2-0 or 4-C methylene bridge. 
However, Kreutzer and Limmer similarly fail to show to what extent these modifications 



WO 2004/050900 



PCT/US2003/040131 



50 

are tolerated in siRNA molecules nor do they provide any examples of such modified 
siRNA. 

Parrish et al., 2000, Molecular Cell, 6, 1977-1087, tested certain chemical 
modifications targeting the unc-22 gene in C. elegans using long (>25 nt) siRNA 

5 transcripts. The authors describe the introduction of thiophosphate residues into these 
siRNA transcripts by incorporating thiophosphate nucleotide analogs with T7 and T3 
RNA polymerase and observed that "RNAs with two [phosphorothioate] modified bases 
also had substantial decreases in effectiveness as RNAi triggers; [phosphorothioate] 
modification of more than two residues greatly destabilized the RNAs in vitro and we 

10 were not able to assay interference activities. 11 Parrish et al. at 1081. The authors also 
tested certain modifications at the 2'-position of the nucleotide sugar in the long siRNA 
transcripts and observed that substituting deoxynucleotides for ribonucleotides "produced 
a substantial decrease in interference activity", especially in the case of Uridine to 
Thymidine and/or Cytidine to deoxy-Cytidine substitutions. Parrish et al. In addition, the 

15 authors tested certain base modifications, including substituting 4-thiouracil, 5- 

bromouracil, 5-iodouracil, 3-(aminoallyl)uracil for uracil, and inosine for guanosine in 
sense and antisense strands of the siRNA, and found that whereas 4-thiouracil and 5- 
bromouracil were all well tolerated, inosine "produced a substantial decrease in 
interference activity" when incorporated in either strand. Incorporation of 5-iodouracil and 

20 3-(aminoallyl)uracil in the antisense strand resulted in substantial decrease in RNAi 
activity as well. 

Beach et al., WO 01/68836, describes specific methods for attenuating gene 
expression using endogenously derived dsRNA. Tuschl et al., WO 01/75164, describes a 
Drosophila in vitro RNAi system and the use of specific siRNA molecules for certain 

25 functional genomic and certain therapeutic applications; although Tuschl, 2001, Chem. 
Biochem., 2, 239-245, doubts that RNAi can be used to cure genetic diseases or viral 
infection due "to the danger of activating interferon response". Li et al., WO 00/44914, 
describes the use of specific dsRNAs for use in attenuating the expression of certain target 
genes. Zernicka-Goetz et al., WO 01/36646, describes certain methods for inhibiting the 

30 expression of particular genes in mammalian cells using certain dsRNA molecules. Fire et 
al., WO 99/32619, U.S. Patent No. 6,506,559, the contents of which are hereby 
incorporated by reference in their entirety, describes particular methods for introducing 
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certain dsRNA molecules into cells for use in inhibiting gene expression. Plaetinck et al., 
WO 00/01846, describes certain methods for identifying specific genes responsible for 
conferring a particular phenotype in a cell using specific dsRNA molecules. Mello et al., 
WO 01/29058, describes the identification of specific genes involved in dsRNA mediated 
5 RNAi. Deschamps Depaillette et al, International PCT Publication No. WO 99/07409, 
describes specific compositions consisting of particular dsRNA molecules combined with 
certain anti-viral agents. Driscoll et al., International PCT Publication No. WO 01/49844, 
describes specific DNA constructs for use in facilitating gene silencing in targeted 
organisms. Parrish et al., 2000, Molecular Cell, 6, 1977-1087, describes specific 
10 chemically modified siRNA constructs targeting the unc-22 gene of C. elegans. Tuschl et 
al., International PCT Publication No. WO 02/44321, describe certain synthetic siRNA 
constructs. 

Methods for Using Nucleic Acid Molecules as Probes and Primers 
The isolated nucleic acid molecules of the present invention can be used as 
15 hybridization probes to detect, characterize, and quantify hybridizing nucleic acids in, and 
isolate hybridizing nucleic acids from, both genomic and transcript-derived nucleic acid 
samples. When free in solution, such probes are typically, but not invariably, detectably 
labeled; bound to a substrate, as in a microarray, such probes are typically, but not 
invariably unlabeled. 

20 In one embodiment, the isolated nucleic acid molecules of the present invention 

can be used as probes to detect and characterize gross alterations in the gene of a CSNA, 
such as deletions, insertions, translocations, and duplications of the CSNA genomic locus 
through fluorescence in situ hybridization (FISH) to chromosome spreads. See, e.g., 
Andreeff et al. (eds.), Introduction to Fluorescence In Situ Hybridization: Principles and 

25 Clinical Applications, John Wiley & Sons (1999). The isolated nucleic acid molecules of 
the present invention can be used as probes to assess smaller genomic alterations using, 
e.g., Southern blot detection of restriction fragment length polymorphisms. The isolated 
nucleic acid molecules of the present invention can be used as probes to isolate genomic 
clones that include a nucleic acid molecule of the present invention, which thereafter can 

30 be restriction mapped and sequenced to identify deletions, insertions, translocations, and 
substitutions (single nucleotide polymorphisms, SNPs) at the sequence level. 
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Alternatively, detection techniques such as molecular beacons may be used, see Kostrikis 
et al Science 279:1228-1229 (1998). 

The isolated nucleic acid molecules of the present invention can also be used as 
probes to detect, characterize, and quantify CSNA in, and isolate CSNA from, transcript- 

5 derived nucleic acid samples. In one embodiment, the isolated nucleic acid molecules of 
the present invention can be used as hybridization probes to detect, characterize by length, 
and quantify mRNA by Northern blot of total or poly-A + - selected RNA samples. In 
another embodiment, the isolated nucleic acid molecules of the present invention can be 
used as hybridization probes to detect, characterize by location, and quantify mRNA by in 

10 situ hybridization to tissue sections. See, e.g., Schwarchzacher et al, In Situ 

Hybridization, Springer-Verlag New York (2000). In another preferred embodiment, the 
isolated nucleic acid molecules of the present invention can be used as hybridization 
probes to measure the representation of clones in a cDNA library or to isolate hybridizing 
nucleic acid molecules acids from cDNA libraries, permitting sequence level 

15 characterization of mRNAs that hybridize to CSNAs, including, without limitations, 

identification of deletions, insertions, substitutions, truncations, alternatively spliced forms 
and single nucleotide polymorphisms. In yet another preferred embodiment, the nucleic 
acid molecules of the instant invention may be used in microarrays. 

All of the aforementioned probe techniques are well within the skill in the art, and 

20 are described at greater length in standard texts such as Sambrook (2001), supra; Ausubel 
(1999), supra; and Walker et al (eds.), The Nucleic Acids Protocols Handbook. Humana 
Press (2000). 

In another embodiment, a nucleic acid molecule of the invention may be used as a 
probe or primer to identify and/or amplify a second nucleic acid molecule that selectively 

25 hybridizes to the nucleic acid molecule of the invention. In this embodiment, it is 

preferred that the probe or primer be derived from a nucleic acid molecule encoding a 
CSP. More preferably, the probe or primer is derived from a nucleic acid molecule 
encoding a polypeptide having an amino acid sequence of SEQ ED NO: 1 13-259. Also 
preferred are probes or primers derived from a CSNA. More preferred are probes or 

30 primers derived from a nucleic acid molecule having a nucleotide sequence of SEQ ID 
NO: 1-112. 

In general, a probe or primer is at least 10 nucleotides in length, more preferably at 
least 12, more preferably at least 14 and even more preferably at least 16 or 17 nucleotides 
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in length. In an even more preferred embodiment, the probe or primer is at least 1 8 
nucleotides in length, even more preferably at least 20 nucleotides and even more 
preferably at least 22 nucleotides in length. Primers and probes may also be longer in 
length. For instance, a probe or primer may be 25 nucleotides in length, or may be 30, 40 

5 or 50 nucleotides in length. Methods of performing nucleic acid hybridization using 

oligonucleotide probes are well known in the art. See, e.g, Sambrook et aL, 1989, supra, 
Chapter 11 and pp. 11.31-11.32 and 11.40-11.44, which describes radiolabeling of short 
probes, and pp. 11.45-1 1.53, which describe hybridization conditions for oligonucleotide 
probes, including specific conditions for probe hybridization (pp. 1 1 .50-1 1.51). 

10 Methods of performing primer-directed amplification are also well known in the 

art Methods for performing the polymerase chain reaction (PCR) are compiled, inter alia, 
in McPherson, PCR Basics: From Background to Bench, Springer Verlag (2000); Tnnis et 
aL (eds.), PCR Applications: Protocols for Functional Genomics. Academic Press (1999); 
Gelfand et aL (eds.), PCR Strategies, Academic Press (1998); Newton et aL, PCR, 

1 5 Springer-Verlag New York (1997); Burke (ed.), PCR: Essential Techniques, John Wiley 
& Son Ltd (1996); White (ed.), PCR Cloning Protocols: From Molecular Cloning to 
Genetic Engineering, Vol. 67, Humana Press (1996); and McPherson et aL (eds.), PCR 2: 
A Practical Approach, Oxford University Press, Inc. (1995). Methods for performing RT- 
PCR are collected, e.g., in Siebert et aL (eds.), Gene Cloning and Analysis by RT-PCR, 

20 Eaton Publishing Company/Bio Techniques Books Division, 1998; and Siebert (ed.), PCR 
Technique:RT-PCR. Eaton Publishing Company/ BioTechniques Books (1995). 

PCR and hybridization methods may be used to identify and/or isolate nucleic acid 
molecules of the present invention including allelic variants, homologous nucleic acid 
molecules and fragments. PCR and hybridization methods may also be used to identify, 

25 amplify and/or isolate nucleic acid molecules of the present invention that encode 

homologous proteins, analogs, fusion proteins or muteins of the invention. Nucleic acid 
primers as described herein can be used to prime amplification of nucleic acid molecules 
of the invention, using transcript-derived or genomic DNA as the template. 

These nucleic acid primers can also be used, for example, to prime single base 

30 extension (SBE) for SNP detection (See, e.g., U.S. Pat. No. 6,004,744, the disclosure of 
which is incorporated herein by reference in its entirety). 

Isothermal amplification approaches, such as rolling circle amplification, are also 
now well-described. See, e.g 9 Schweitzer et aL, Curr. Opin. BiotechnoL 12(1): 21-7 
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(2001); International Patent publications WO 97/19193 and WO 00/15779, and U.S. 
Patent Nos. 5,854,033 and 5,714,320, the disclosures of which are incorporated herein by 
reference in their entireties. Rolling circle amplification can be combined with other 
techniques to facilitate SNP detection. See, e.g., Lizardi et al 9 Nature Genet 19(3): 
5 225-32 (1998). 

Nucleic acid molecules of the present invention may be bound to a substrate either 
covalently or noncovalently. The substrate can be porous or solid, planar or non-planar, 
unitary or distributed. The bound nucleic acid molecules may be used as hybridization 
probes, and may be labeled or unlabeled. In a preferred embodiment, the bound nucleic 
1 0 acid molecules are unlabeled 

In one embodiment, the nucleic acid molecule of the present invention is bound to 
a porous substrate, e.g., a membrane, typically comprising nitrocellulose, nylon, or 
positively charged derivatized nylon. The nucleic acid molecule of the present invention 
can be used to detect a hybridizing nucleic acid molecule that is present within a labeled 
15 nucleic acid sample, e.g., a sample of transcript-derived nucleic acids. In another 

embodiment, the nucleic acid molecule is bound to a solid substrate, including, without 
limitation, glass, amorphous silicon, crystalline silicon or plastics. Examples of plastics 
include, without limitation, polymethylacrylic, polyethylene, polypropylene, polyacrylate, 
polymethylmethacrylate, polyvinylchloride, polytetrafluoroethylene, polystyrene, 
20 polycarbonate, polyacetal, polysulfone, celluloseacetate, cellulosenitrate, nitrocellulose, or 
mixtures thereof. The solid substrate may be any shape, including rectangular, disk-like 
and spherical. In a preferred embodiment, the solid substrate is a microscope slide or 
slide-shaped substrate. 

The nucleic acid molecule of the present invention can be attached covalently to a 
25 surface of the support substrate or applied to a derivatized surface in a chaotropic agent 
that facilitates denaturation and adherence by presumed noncovalent interactions, or some 
combination thereof. The nucleic acid molecule of the present invention can be bound to a 
substrate to which a plurality of other nucleic acids are concurrently bound, hybridization 
to each of the plurality of bound nucleic acids being separately detectable. At low density, 
30 e.g. on a porous membrane, these substrate-bound collections are typically denominated 
macroairays; at higher density, typically on a solid support, such as glass, these substrate 
bound collections of plural nucleic acids are colloquially termed microarrays. As used 
herein, the term microarray includes arrays of all densities. It is, therefore, another aspect 
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of the invention to provide microarrays that comprise one or more of the nucleic acid 
molecules of the present invention. 

In yet another embodiment, the invention is directed to single exon probes based 
on the CSNAs disclosed herein. 

5 Expression Vectors, Host Cells and Recombinant Methods of Producing 

Polypeptides 

Another aspect of the present invention provides vectors that comprise one or more 
of the isolated nucleic acid molecules of the present invention, and host cells in which 
such vectors have been introduced. 

10 The vectors can be used, inter alia 9 for propagating the nucleic acid molecules of 

the present invention in host cells (cloning vectors), for shuttling the nucleic acid 
molecules of the present invention between host cells derived from disparate organisms 
(shuttle vectors), for inserting the nucleic acid molecules of the present invention into host 
cell chromosomes (insertion vectors), for expressing sense or antisense RNA transcripts of 

15 the nucleic acid molecules of the present invention in vitro or within a host cell, and for 
expressing polypeptides encoded by the nucleic acid molecules of the present invention, 
alone or as fusion proteins with heterologous polypeptides (expression vectors). Vectors 
are by now well known in the art, and are described, inter alia 9 in Jones et al (eds.), 
Vectors: Cloning Applications: Essential Techniques (Essential Techniques Series), John 

20 Wiley & Son Ltd. (1998); Jones et al (eds.), Vectors: Expression Systems: Essential 

Techniques (Essential Techniques Series), John Wiley & Son Ltd. (1998); Gacesa et al, 
Vectors: Essential Data. John Wiley & Sons Ltd. (1995); Cid-Arregui (eds.), Viral 
Vectors: Basic Science and Gene Therapy. Eaton Publishing Co. (2000); Sambrook 
(2001), supra; Ausubel (1999), supra. Furthermore, a variety of vectors are available 

25 commercially. Use of existing vectors and modifications thereof are well within the skill 
in the art Thus, only basic features need be described here. 

Nucleic acid sequences may be expressed by operatively linking them to an 
expression control sequence in an appropriate expression vector and employing that 
expression vector to transform an appropriate unicellular host Expression control 

30 sequences are sequences that control the transcription, post-transcriptional events and 

translation of nucleic acid sequences. Such operative linking of a nucleic acid sequence of 
this invention to an expression control sequence, of course, includes, if not already part of 
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the nucleic acid sequence, the provision of a translation initiation codon, ATG or GTG, in 
the correct reading frame upstream of the nucleic acid sequence. 

A wide variety of host/expression vector combinations may be employed in 
expressing the nucleic acid sequences of this invention. Useful expression vectors, for 

5 example, may consist of segments of chromosomal, non-chromosomal and synthetic 
nucleic acid sequences. 

In one embodiment, prokaryotic cells may be used with an appropriate vector. 
Prokaryotic host cells are often used for cloning and expression. In a preferred 
embodiment, prokaryotic host cells include E. coli 9 Pseudomonas, Bacillus and 

10 Streptomyces. In a preferred embodiment, bacterial host cells are used to express the 
nucleic acid molecules of the instant invention. Useful expression vectors for bacterial 
hosts include bacterial plasmids, such as those from E. coli 9 Bacillus or Streptomyces, 
including pBluescript, pGEX-2T, pUC vectors, col El, pCRl, pBR322, pMB9 and their 
derivatives, wider host range plasmids, such as RP4, phage DNAs, e.g. 9 the numerous 

15 derivatives of phage lambda, e.g. 9 NM989, XGT10 and XGT1 1, and other phages, e.g 9 
Ml 3 and filamentous single-stranded phage DNA. Where E. coli is used as host, 
selectable markers are, analogously, chosen for selectivity in gram negative bacteria: e.g 9 
typical markers confer resistance to antibiotics, such as ampicillin, tetracycline, 
chloramphenicol, kanamycin, streptomycin and zeocin; auxotrophic markers can also be 

20 used. 

In other embodiments, eukaryotic host cells, such as yeast, insect, mammalian or 
plant cells, may be used. Yeast cells, typically S. cerevisiae, are useful for eukaryotic 
genetic studies, due to the ease of targeting genetic changes by homologous recombination 
and the ability to easily complement genetic defects using recombinantly expressed 

25 proteins. Yeast cells are useful for identifying interacting protein components, e.g. 

through use of a two-hybrid system. In a preferred embodiment, yeast cells are useful for 
protein expression. Vectors of the present invention for use in yeast will typically, but not 
invariably, contain an origin of replication suitable for use in yeast and a selectable marker 
that is functional in yeast Yeast vectors include Yeast Integrating plasmids (e.g., YIp5) 

30 and Yeast Replicating plasmids (the YRp and YEp series plasmids), Yeast Centromere 
plasmids (the YCp series plasmids), Yeast Artificial Chromosomes (YACs) which are 
based on yeast linear plasmids, denoted YLp, pGPD-2, 2\i plasmids and derivatives 
thereof, and improved shuttle vectors such as those described in Gietz et al 9 Gene, 74: 
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527-34 (1988) (YIplac, YEplac and YCplac). Selectable markers in yeast vectors include 
a variety of auxotrophic markers, the most common of which are (in Saccharomyces 
cerevisiae) URA3, fflS3, LEU2, TRP1 and LYS2, which complement specific 
auxotrophic mutations, such as ura3-52, his3-Dl, Ieu2-Dl, trpl-Dl and Iys2-20L 
5 Insect cells may be chosen for high efficiency protein expression* Where the host 

cells are from Spodopterajrugiperda, e.g., SO and SCI cell lines, and expresSF™ cells 
(Protein Sciences Corp., Meriden, CT, USA), the vector replicative strategy is typically 
based upon the baculovirus life cycle. Typically, baculovirus transfer vectors are used to 
replace the wild-type AcMNPV polyhedrin gene with a heterologous gene of interest 
10 Sequences that flank the polyhedrin gene in the wild-type genome are positioned 5' and 3' 
of the expression cassette on the transfer vectors. Following co-transfection with 
AcMNPV DNA, a homologous recombination event occurs between these sequences 
resulting in a recombinant virus carrying the gene of interest and the polyhedrin or plO 
promoter. Selection can be based upon visual screening for lacZ fusion activity. 
15 The host cells may also be mammalian cells, which are particularly useful for 

expression of proteins intended as pharmaceutical agents, and for screening of potential 
agonists and antagonists of a protein or a physiological pathway. Mammalian vectors 
intended for autonomous extrachromosomal replication will typically include a viral 
origin, such as the SV40 origin (for replication in cell lines expressing the large T-antigen, 
20 such as COS1 and COS7 cells), the papillomavirus origin, or the EBV origin for long term 
episomal replication (for use, e.g. 9 in 293-EBNA cells, which constitutively express the 
EBV EBNA-1 gene product and adenovirus El A). Vectors intended for integration, and 
thus replication as part of the mammalian chromosome, can, but need not, include an 
origin of replication functional in mammalian cells, such as the SV40 origin. Vectors 
25 based upon viruses, such as adenovirus, adeno-associated virus, vaccinia virus, and 

various mammalian retroviruses, will typically replicate according to the viral replicative 
strategy. Selectable markers for use in mammalian cells include, but are not limited to, 
resistance to neomycin (G418), blasticidin, hygromycin and zeocin, and selection based 
upon the purine salvage pathway using HAT medium. 
30 Expression in mammalian cells can be achieved using a variety of plasmids, 

including pSV2, pBC12BI, and p91023, as well as lytic virus vectors (e.g., vaccinia virus, 
adeno virus, and baculovirus), episomal virus vectors (e.g„ bovine papillomavirus), and 
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retroviral vectors (e.g., murine retroviruses). Useful vectors for insect cells include 
baculoviral vectors and pVL 941 . 

Plant cells can also be used for expression, with the vector replicon typically 
derived from a plant virus (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, 

5 TMV) and selectable markers chosen for suitability in plants. 

It is known that codon usage of different host cells may be different. For example, 
a plant cell and a human cell may exhibit a difference in codon preference for encoding a 
particular amino acid. As a result, human mRNA may not be efficiently translated in a 
plant, bacteria or insect host cell. Therefore, another embodiment of this invention is 

10 directed to codon optimization. The codons of the nucleic acid molecules of the invention 
may be modified to resemble, as much as possible, genes naturally contained within the 
host cell without altering the amino acid sequence encoded by the nucleic acid molecule. 

Any of a wide variety of expression control sequences may be used in these 
vectors to express the nucleic acid molecules of this invention. Such useful expression 

15 control sequences include the expression control sequences associated with structural 
genes of the foregoing expression vectors. Expression control sequences that control 
transcription include, e.g., promoters, enhancers and transcription termination sites. 
Expression control sequences in eukaryotic cells that control post-transcriptional events 
include splice donor and acceptor sites and sequences that modify the half-life of the 

20 transcribed RNA, e.g., sequences that direct poly(A) addition or binding sites for RNA- 
binding proteins. Expression control sequences that control translation include ribosome 
binding sites, sequences which direct targeted expression of the polypeptide to or within 
particular cellular compartments, and sequences in the 5' and 3' untranslated regions that 
modify the rate or efficiency of translation. 

25 Examples of useful expression control sequences for a prokaryote, eg., R coli, 

will include a promoter, often a phage promoter, such as phage lambda pL promoter, the 
trc promoter, a hybrid derived from the trp and lac promoters, the bacteriophage 17 
promoter (in E. coli cells engineered to express the 17 polymerase), the TAC or TRC 
system, the major operator and promoter regions of phage lambda, the control regions of 

30 fd coat protein, and the araBAD operon. Prokaryotic expression vectors may further 

include transcription terminators, such as the aspA terminator, and elements that facilitate 
translation, such as a consensus ribosome binding site and translation termination codon, 
Schomer et al y Proc. Natl Acad. Set USA 83: 8506-8510 (1986). 
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Expression control sequences for yeast cells, typically S. cerevisiae, will include a 
yeast promoter, such as the CYC1 promoter, the GAL1 promoter, the GAL10 promoter, 
ADH1 promoter, the promoters of the yeast a-mating system, or the GPD promoter, and 
will typically have elements that facilitate transcription termination, such as the 
5 transcription termination signals from the CYC1 or ADH1 gene. 

Expression vectors useful for expressing proteins in mammalian cells will include 
a promoter active in mammalian cells. These promoters include, but are not limited to, 
those derived from mammalian viruses, such as the enhancer-promoter sequences from the 
immediate early gene of the human cytomegalovirus (CMV), the enhancer-promoter 
10 sequences from the Rous sarcoma virus long terminal repeat (RSV LTR), the enhancer- 
promoter from SV40 and the early and late promoters of adenovirus. Other expression 
control sequences include the promoter for 3-phosphoglycerate kinase or other glycolytic 
enzymes, the promoters of acid phosphatase. Other expression control sequences include 
those from the gene comprising the CSNA of interest. Often, expression is enhanced by 
15 incorporation of polyadenylation sites, such as the late SV40 polyadenylation site and the 
polyadenylation signal and transcription termination sequences from the bovine growth 
hormone (BGH) gene, and ribosome binding sites. Furthermore, vectors can include 
introns, such as intron II of rabbit (3-globin gene and the SV40 splice elements. 

Preferred nucleic acid vectors also include a selectable or amplifiable marker gene 
20 and means for amplifying the copy number of the gene of interest. Such marker genes are 
well known in the art Nucleic acid vectors may also comprise stabilizing sequences (e.g., 
ori- or ARS-like sequences and telomere-like sequences), or may alternatively be designed 
to favor directed or non-directed integration into the host cell genome. In a preferred 
embodiment, nucleic acid sequences of this invention are inserted in frame into an 
25 expression vector that allows a high level expression of an RNA which encodes a protein 
comprising the encoded nucleic acid sequence of interest. Nucleic acid cloning and 
sequencing methods are well known to those of skill in the art and are described in an 
assortment of laboratory manuals, including Sambrook (1989), supra, Sambrook (2000), 
supra; Ausubel (1992), supra; and Ausubel (1999), supra. Product information from 
30 manufacturers of biological, chemical and immunological reagents also provide useful 
information. 

Expression vectors may be either constitutive or inducible. Inducible vectors 
include either naturally inducible promoters, such as the trc promoter, which is regulated 
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by the lac operon, and the pL promoter, which is regulated by tryptophan, the 
MMTV-LTR promoter, which is inducible by dexamethasone, or can contain synthetic 
promoters and/or additional elements that confer inducible control on adjacent promoters. 
Examples of inducible synthetic promoters are the hybrid Plac/ara-1 promoter and the 

5 PLtetO-1 promoter. The PLtetO-1 promoter takes advantage of the high expression levels 
from the PL promoter of phage lambda, but replaces the lambda repressor sites with two 
copies of operator 2 of the TnlO tetracycline resistance operon, causing this promoter to 
be tightly repressed by the Tet repressor protein and induced in response to tetracycline 
(Tc) and Tc derivatives such as anhydrotetracycline. Vectors may also be inducible 

10 because they contain hormone response elements, such as the glucocorticoid response 
element (GRE) and the estrogen response element (ERE), which can confer hormone 
inducibility where vectors are used for expression in cells having the respective hormone 
receptors. To reduce background levels of expression, elements responsive to ecdysone, 
an insect hormone, can be used instead, with coexpression of the ecdysone receptor. 

15 In one embodiment of the invention, expression vectors can be designed to fuse the 

expressed polypeptide to small protein tags that facilitate purification and/or visualization. 
Such tags include a polyhistidine tag that facilitates purification of the fusion protein by 
immobilized metal affinity chromatography, for example using NiNTA resin (Qiagen Inc., 
Valencia, CA, USA) or TALON™ resin (cobalt immobilized affinity chromatography 

20 medium, Clontech Labs, Palo Alto, CA, USA). The fusion protein can include a chitin- 
binding tag and self-excising intein, permitting chitin-based purification with self-removal 
of the fused tag (IMPACT™ system, New England Biolabs, Inc., Beverley, MA, USA). 
Alternatively, the fusion protein can include a calmodulin-binding peptide tag, permitting 
purification by calmodulin affinity resin (Stratagene, La Jolla, CA, USA), or a specifically 

25 excisable fragment of the biotin carboxylase carrier protein, permitting purification of in 
vivo biotinylated protein using an avidin resin and subsequent tag removal (Promega, 
Madison, WI, USA). As another useful alternative, the polypeptides of the present 
invention can be expressed as a fusion to glutathione-S-transferase, the affinity and 
specificity of binding to glutathione permitting purification using glutathione affinity 

30 resins, such as Glutathione-Superflow Resin (Clontech Laboratories, Palo Alto, CA, 

USA), with subsequent elution with free glutathione. Other tags include, for example, the 
Xpress epitope, detectable by anti-Xpress antibody (Invitrogen, Carlsbad, CA, USA), a 
myc tag, detectable by anti-myc tag antibody, the V5 epitope, detectable by anti-V5 
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antibody (Invitrogen, Carlsbad, CA, USA), FLAG® epitope, detectable by anti-FLAG® 
antibody (Stratagene, La Jolla, CA, USA), and the HA epitope, detectable by anti-HA 
antibody. 

For secretion of expressed polypeptides, vectors can include appropriate sequences 

5 that encode secretion signals, such as leader peptides. For example, the pSecTag2 vectors 
(Invitrogen, Carlsbad, CA, USA) are 5.2 kb mammalian expression vectors that carry the 
secretion signal from the V-J2-C region of the mouse Ig kappa-chain for efficient secretion 
of recombinant proteins from a variety of mammalian cell lines. 

Expression vectors can also be designed to fuse proteins encoded by the 

10 heterologous nucleic acid insert to polypeptides that are larger than purification and/or 

identification tags. Useful protein fusions include those that permit display of the encoded 
protein on the surface of a phage or cell, fusions to intrinsically fluorescent proteins, such 
as those that have a green fluorescent protein (GFP)-like chromophore, fusions to the IgG 
Fc region, and fusions for use in two hybrid systems. 

15 Vectors for phage display fuse the encoded polypeptide to, e.g. , the gene HI 

protein (pUI) or gene VIII protein (pvm) for display on the surface of filamentous phage, 
such as Ml 3. See Barbas et al. 9 Phage Display: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press (2001); Kay et al (eds.), Phage Display of Peptides and Proteins: 
A Laboratory Manual, Academic Press, Inc., (1996); Abelson et al. (eds.), Combinatorial 

20 Chemistry (Methods in Enzymology, Vol. 267) Academic Press (1996). Vectors for yeast 
display, e.g. the pYDl yeast display vector (Invitrogen, Carlsbad, CA, USA), use the 
a-agglutinin yeast adhesion receptor to display recombinant protein on the surface of S. 
cerevisiae. Vectors for mammalian display, e.g. 9 the pDisplay™ vector (Invitrogen, 
Carlsbad, CA, USA), target recombinant proteins using an N-terminai cell surface 

25 targeting signal and a C-terminal transmembrane anchoring domain of platelet derived 
growth factor receptor. 

A wide variety of vectors now exist that fuse proteins encoded by heterologous 
nucleic acids to the chromophore of the substrate-independent, intrinsically fluorescent 
green fluorescent protein from Aequorea victoria ("GFP") and its variants. The GFP-like 

30 chromophore can be selected from GFP-like chromophobes found in naturally occurring 
proteins, such as A. victoria GFP (GenBank accession number AAA27721), Renilla 
reniformis GFP, FP583 (GenBank accession no. AF168419) (DsRed), FP593 (AF27271 1), 
FP483 (AF168420), FP484 (AF168424), FP595 (AF246709), FP486 (AF168421), FP538 
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(AF 168423), and FP506 (AF1 68422), and need include only so much of the native protein 
as is needed to retain the chromophore' s intrinsic fluorescence. Methods for determining 
the minimal domain required for fluorescence are known in the art. See Li et al , J. Biol 
Chem. 272: 28545-28549 (1997). Alternatively, the GFP-like chromophore can be 

5 selected from GFP-like chromophores modified from those found in nature. The methods 
for engineering such modified GFP-like chromophores and testing them for fluorescence 
activity, both alone and as part of protein fusions, are well known in the art. See Heim et 
al 9 Curr. Biol 6: 178-182 (1996) and Palm et al 9 Methods Enzymol 302: 378-394 (1999). 
A variety of such modified chromophores are now commercially available and can readily 

10 be used in the fusion proteins of the present invention. These include EGFP ("enhanced 
GFP"), EBFP ("enhanced blue fluorescent protein"), BFP2, EYFP ("enhanced yellow 
fluorescent protein"), ECFP ("enhanced cyan fluorescent protein") or Citrine. EGFP {see, 
e.g, Cormack et al, Gene 173: 33-38 (1996); U.S. Patent Nos. 6,090,919 and 5,804,387, 
the disclosures of which are incorporated herein by reference in their entireties) is found 

15 on a variety of vectors, both plasmid and viral, which are available commercially 

(Clontech Labs, Palo Alto, CA, USA); EBFP is optimized for expression in mammalian 
cells whereas BFP2, which retains the original jellyfish codons, can be expressed in 
bacteria (see, e.g. Heim et al, Curr. Biol 6: 178-182 (1996) and Cormack et al 9 Gene 
173: 33-38 (1996)). Vectors containing these blue-shifted variants are available from 

20 Clontech Labs (Palo Alto, CA, USA). Vectors containing EYFP, ECFP (see, e.g., Heim et 
al 9 Curr. Biol 6: 178-182 (1996); Miyawaki et al 9 Nature 388: 882-887 (1997)) and 
Citrine (see, e.g., Heikal et al 9 Proc. Natl Acad. Sci. USA 97: 11996-12001 (2000)) are 
also available from Clontech Labs. The GFP-like chromophore can also be drawn from 
other modified GFPs, including those described in U.S. Patent Nos. 6,124,128; 6,096,865; 

25 6,090,919; 6,066,476; 6,054,321; 6,027,881; 5,968,750; 5,874,304; 5,804,387; 5,777,079; 
5,741,668; and 5,625,048, the disclosures of which are incorporated herein by reference in 
their entireties. See also Conn (ed.), Green Fluorescent Protein (Methods in Enzymology, 
Vol. 302), Academic Press, Inc. (1999); Yang, et al, J Biol Chem 9 273: 8212-6 (1998); 
Bevis et al 9 Nature Biotechnology, 20:83-7 (2002). The GFP-like chromophore of each 

30 of these GFP variants can usefully be included in the fusion proteins of the present 
invention. 

Fusions to the IgG Fc region increase serum half-life of protein pharmaceutical 
products through interaction with the FcRn receptor (also denominated the FcRp receptor 
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and the Brambell receptor, FcRb), further described in International Patent Application 
Nos. WO 97/43316, WO 97/34631, WO 96/32478, and WO 96/18412, the disclosures of 
which are incorporated herein by reference in their entireties. 

For long-term, high-yield recombinant production of the polypeptides of the 
5 present invention, stable expression is preferred. Stable expression is readily achieved by 
integration into the host cell genome of vectors having selectable markers, followed by 
selection of these integrants. Vectors such as pUB6/V5-His A, B, and C (Invitrogen, 
Carlsbad, CA, USA) are designed for high-level stable expression of heterologous proteins 
in a wide range of mammalian tissue types and cell lines. pUB6/V5-His uses the 

10 promoter/enhancer sequence from the human ubiquitin C gene to drive expression of 

recombinant proteins: expression levels in 293, CHO, and NIH3T3 cells are comparable to 
levels from the CMV and human EF-la promoters. The bsd gene permits rapid selection 
of stably transfected mammalian cells with the potent antibiotic blasticidin. 

Replication incompetent retroviral vectors, typically derived from Moloney murine 

1 5 leukemia virus, also are useful for creating stable transfectants having integrated provirus. 
The highly efficient transduction machinery of retroviruses, coupled with the availability 
of a variety of packaging cell lines such as RetroPack™ PT 67, EcoPack2™-293, 
AmphoPack-293,.and GP2-293 cell lines (all available from Clontech Laboratories, Palo 
Alto, CA, USA) allow a wide host range to be infected with high efficiency; varying the 

20 multiplicity of infection readily adjusts the copy number of the integrated provirus. 

Of course, not all vectors and expression control sequences will function equally 
well to express the nucleic acid molecules of this invention. Neither will all hosts function 
equally well with the same expression system. However, one of skill in the art may make 
a selection among these vectors, expression control sequences and hosts without undue 

25 experimentation and without departing from the scope of this invention. For example, in 
selecting a vector, the host must be considered because the vector must be replicated in it. 
The vector's copy number, the ability to control that copy number, the ability to control 
integration, if any, and the expression of any other proteins encoded by the vector, such as 
an antibiotic or other selection marker, should also be considered. The present invention 

30 further includes host cells comprising the vectors of the present invention, either present 
episomally within the cell or integrated, in whole or in part, into the host cell chromosome. 
Among other considerations, some of which are described above, a host cell strain may be 
chosen for its ability to process the expressed polypeptide in the desired fashion. Such 
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post-translational modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation, and it 
is an aspect of the present invention to provide CSPs with such post-translational 
modifications. 

5 In selecting an expression control sequence, a variety of factors should also be 

considered. These include, for example, the relative strength of the sequence, its 
controllability, and its compatibility with the nucleic acid molecules of this invention, 
particularly with regard to potential secondary structures. Unicellular hosts should be 
selected by consideration of their compatibility with the chosen vector, the toxicity of the 
10 product coded for by the nucleic acid sequences of this invention, their secretion 

characteristics, their ability to fold the polypeptide correctly, their fermentation or culture 
requirements, and the ease of purification from them of the products coded for by the 
nucleic acid molecules of this invention. 

The recombinant nucleic acid molecules and more particularly, the expression 
1 5 vectors of this invention may be used to express the polypeptides of this invention as 

recombinant polypeptides in a heterologous host cell. The polypeptides of this invention 
may be full-length or less than full-length polypeptide fragments recombinantly expressed 
from the nucleic acid molecules according to this invention. Such polypeptides include 
analogs, derivatives and muteins that may or may not have biological activity. 
20 Vectors of the present invention will also often include elements that permit in 

vitro transcription of RNA from the inserted heterologous nucleic acid. Such vectors 
typically include a phage promoter, such as that from 17, T3, or SP6, flanking the nucleic 
acid insert. Often two different such promoters flank the inserted nucleic acid, permitting 
separate in vitro production of both sense and antisense strands. 
25 Transformation and other methods of introducing nucleic acids into a host cell 

(e.g., conjugation, protoplast transformation or fusion, transfection, electroporation, 
liposome delivery, membrane fusion techniques, high velocity DNA-coated pellets, viral 
infection and protoplast fusion) can be accomplished by a variety of methods which are 
well known in the art (See, for instance, Ausubel, supra, and Sambrook et ah, supra). 
30 Bacterial, yeast, plant or mammalian cells are transformed or transfected with an 

expression vector, such as a plasmid, a cosmid, or the like, wherein the expression vector 
comprises the nucleic acid of interest Alternatively, the cells may be infected by a viral 
expression vector comprising the nucleic acid of interest. Depending upon the host cell, 
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vector, and method of transformation used, transient or stable expression of the 
polypeptide will be constitutive or inducible. One having ordinary skill in the art will be 
able to decide whether to express a polypeptide transiently or stably, and whether to 
express the protein constitutively or inducibly. 
5 A wide variety of unicellular host cells are useful in expressing the DNA 

sequences of this invention. These hosts may include well known eukaryotic and 
prokaryotic hosts, such as strains of, fungi, yeast, insect cells such as Spodoptera 
frugiperda (SF9), animal cells such as CHO, as well as plant cells in tissue culture. 
Representative examples of appropriate host cells include, but are not limited to, bacterial 
10 cells, such as E. coli, Caulobacter crescentus, Streptomyces species, and Salmonella 

typhimurium; yeast cells, such as Saccharomyces cerevisiae, Schizosaccharomyces pombe, 
Pichia pastoris, Pichia methanolica; insect cell lines, such as those from Spodoptera 
frugiperda, e.g., Sf9 and Sf21 cell lines, and expresSF™ cells (Protein Sciences Corp., 
Meriden, CT, USA), Drosophila S2 cells, and Trichoplusia ni High Five® Cells 
1 5 (Invitrogen, Carlsbad, CA, USA); and mammalian cells. Typical mammalian cells include 
BHK cells, BSC 1 cells, BSC 40 cells, BMT 10 cells, VERO cells, COS1 cells, COS7 
cells, Chinese hamster ovary (CHO) cells, 3T3 cells, NIH 3T3 cells, 293 cells, HEPG2 
cells, HeLa cells, L cells, MDCK cells, HEK293 cells, WD 8 cells, murine ES cell lines 
(e.g., from strains 129/SV, C57/BL6, DBA-1, 129/SVJ), K562 cells, Jurkat cells, and 
20 BW5147 cells. Other mammalian cell lines are well known and readily available from 
the American Type Culture Collection (ATCC) (Manassas, VA, USA) and the National 
Institute of General Medical Sciences (NIGMS) Human Genetic Cell Repository at the 
Coriell Cell Repositories (Camden, NJ, USA). Cells or cell lines derived from colon are 
particularly preferred because they may provide a more native post-translational 
25 processing. Particularly preferred are human colon cells. 

Particular details of the transfection, expression and purification of recombinant 
proteins are well documented and are understood by those of skill in the art. Further 
details on the various technical aspects of each of the steps used in recombinant 
production of foreign genes in bacterial cell expression systems can be found in a number 
30 of texts and laboratory manuals in the art. See, e.g., Ausubel (1992), supra, Ausubel 
(1999), supra, Sambrook (1989), supra, and Sambrook (2001), supra. 
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Methods for introducing the vectors and nucleic acid molecules of the present 
invention into the host cells are well known in the art; the choice of technique will depend 
primarily upon the specific vector to be introduced and the host cell chosen. 

Nucleic acid molecules and vectors may be introduced into prokaryotes, such as E. 
5 coli, in a number of ways. For instance, phage lambda vectors will typically be packaged 
using a packaging extract (e.g., Gigapack® packaging extract, Stratagene, La Jolla, CA, 
USA), and the packaged virus used to infect E. coli. 

Plasmid vectors will typically be introduced into chemically competent or 
electrocompetent bacterial cells. E. coli cells can be rendered chemically competent by 

10 treatment, e.g., with CaCl 2 , or a solution of Mg 2+ , Mn 2+ , Ca 2+ , Rb + or KT**, dimethyl 

sulfoxide, dithiothreitol, and hexamine cobalt (HI), Hanahan, J. Mot Biol 166(4):557-80 
(1983), and vectors introduced by heat shock. A wide variety of chemically competent 
strains are also available commercially (e.g. 9 Epicurian Coli® XLIO-Gold® 
Ultracompetent Cells (Stratagene, La Jolla, CA, USA); DH5a competent cells (Clontech 

1 5 Laboratories, Palo Alto, C A, USA); and TOP10 Chemically Competent E. coli Kit 

(Invitrogen, Carlsbad, CA, USA)). Bacterial cells can be rendered electrocompetent to 
take up exogenous DNA by electroporation by various pre-pulse treatments; vectors are 
introduced by electroporation followed by subsequent outgrowth in selected media. An 
extensive series of protocols is provided by BioRad (Richmond, CA, USA). 

20 Vectors can be introduced into yeast cells by spheroplasting, treatment with 

lithium salts, electroporation, or protoplast fusion. Spheroplasts are prepared by the action 
of hydrolytic enzymes such as a snail-gut extract, usually denoted Glusulase or 
Zymolyase, or an enzyme from Arthrobacter luteus to remove portions of the cell wall in 
the presence of osmotic stabilizers, typically 1 M sorbitol. DNA is added to the 

25 spheroplasts, and the mixture is co-precipitated with a solution of polyethylene glycol 

(PEG) and Ca 2+ . Subsequently, the cells are resuspended in a solution of sorbitol, mixed 
with molten agar and then layered on the surface of a selective plate containing sorbitol. 

For lithium-mediated transformation, yeast cells are treated with lithium acetate to 
penneabilize the cell wall, DNA is added and the cells are co-precipitated with PEG. The 

30 cells are exposed to a brief heat shock, washed free of PEG and lithium acetate, and 
subsequently spread on plates containing ordinary selective medium. Increased 
frequencies of transformation are obtained by using specially-prepared single-stranded 
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carrier DNA and certain organic solvents. Schiestl et at, Curr. Genet. 16(5-6): 339-46 
(1989). 

For electroporation, freshly-grown yeast cultures are typically washed, suspended 
in an osmotic protectant, such as sorbitol, mixed with DNA, and the cell suspension 
5 pulsed in an electroporation device. Subsequently, the cells are spread on the surface of 
plates containing selective media. Becker et al. 9 Methods EnzymoL 194: 182-187 (1991). 
The efficiency of transformation by electroporation can be increased over 100-fold by 
using PEG, single-stranded carrier DNA and cells that are in late log-phase of growth. 
Larger constructs, such as YACs, can be introduced by protoplast fusion. 

10 Mammalian and insect cells can be directly infected by packaged viral vectors, or 

transfected by chemical or electrical means. For chemical transfection, DNA can be 
coprecipitated with CaP04 or introduced using liposomal and nonliposomal lipid-based 
agents. Commercial kits are available for CaP04 transfection (CalPhos™ Mammalian 
Transfection Kit, Clontech Laboratories, Palo Alto, CA, USA), and lipid-mediated 

15 transfection can be practiced using commercial reagents, such as LIPOFECTAMINE™ 
2000, LIPOFECTAMINE™ Reagent, CELLFECTIN® Reagent, and LIPOFECTIN® 
Reagent (Invitrogen, Carlsbad, CA, USA), DOTAP Liposomal Transfection Reagent, 
FuGENE 6, X-tremeGENE Q2, DOSPER, (Roche Molecular Biochemicals, Indianapolis, 
IN USA), Effectene™, PolyFect®, Superfect® (Qiagen, Inc., Valencia, CA, USA). 

20 Protocols for electroporating mammalian cells can be found in, for example, ; Norton et 
at (eds.), Gene Transfer Methods: Introducing DNA into Living Cells and Organisms. 
BioTechniques Books, Eaton Publishing Co. (2000). Other transfection techniques 
include transfection by particle bombardment and microinjection. See, e.g., Cheng et at, 
Proc. Natl Acad. Set USA 90(10): 4455-9 (1993); Yang et at, Proc. Natl Acad. Set USA 

25 87(24): 9568-72 (1990). 

Production of the recombinantly produced proteins of the present invention can 
optionally be followed by purification. 

Purification of recombinantly expressed proteins is now well within the skill in the 
art and thus need not be detailed here. See, e.g., Thorner et at (eds.), Applications of 

30 Chimeric Genes and Hybrid Proteins, Part A: Gene Expression and Protein Purification 
(Methods in Enzymology, Vol. 326), Academic Press (2000); Harbin (ed.), Cloning Gene 
Expression and Protein Purification : Experimental Procedures and Process Rationale. 
Oxford Univ. Press (2001); Marshak et al 9 Strategies for Protein Purification and 



WO 2004/050900 



PCT/US2003/040131 



68 

Characterization: A Laboratory Course Manual, Cold Spring Harbor Laboratory Press 
(1996); and Roe (ed.), Protein Purification Applications. Oxford University Press (2001). 

Briefly, however, if purification tags have been fused through use of an expression 
vector that appends such tags, purification can be effected, at least in part, by means 
5 appropriate to the tag, such as use of immobilized metal affinity chromatography for 
polyhistidine tags. Other techniques common in the art include ammonium sulfate 
fractionation, immunoprecipitation, fast protein liquid chromatography (FPLC), high 
performance liquid chromatography (HPLC), and preparative gel electrophoresis. 

Polypeptides, including Fragments Muteins. Homologous Proteins. Allelic Variants, 

10 Analogs and Derivatives 

Another aspect of the invention relates to polypeptides encoded by the nucleic acid 
molecules described herein. In a preferred embodiment, the polypeptide is a colon 
specific polypeptide (CSP). In an even more preferred embodiment, the polypeptide 
comprises an amino acid sequence of SEQ ID NO: 1 13-259 or is derived from a 

15 polypeptide having the amino acid sequence of SEQ ID NO: 1 13-259. A polypeptide as 
defined herein may be produced recombinantly, as discussed supra, may be isolated from 
a cell that naturally expresses the protein, or may be chemically synthesized following the 
teachings of the specification and using methods well known to those having ordinary skill 
in the art 

20 Polypeptides of the present invention may also comprise a part or fragment of a 

CSP. In a preferred embodiment, the fragment is derived from a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1 13-259. 
Polypeptides of the present invention comprising a part or fragment of an entire CSP may 
or may not be CSPs. For example, a full-length polypeptide may be colon-specific, while 

25 a fragment thereof may be found in other tissues as well as in colon. A polypeptide that is 
not a CSP, whether it is a fragment, analog, mutein, homologous protein or derivative, is 
nevertheless useful, especially for immunizing animals to prepare anti-CSP antibodies. In 
a preferred embodiment, the part or fragment is a CSP. Methods of determining whether a 
polypeptide of the present invention is a CSP are described infra. 

30 Polypeptides of the present invention comprising fragments of at least 6 

contiguous amino acids are also useful in mapping B cell and T cell epitopes of the 
reference protein. See, e.g., Geysen etal, Proc. Natl. Acad. Sci. USA 81: 3998-4002 
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(1984) and U.S. Patent Nos. 4,708,871 and 5,595,915, the disclosures of which are 
incorporated herein by reference in their entireties. Because the fragment need not itself 
be immunogenic, part of an immunodominant epitope, nor even recognized by native 
antibody, to be useful in such epitope mapping, all fragments of at least 6 amino acids of a 
5 polypeptide of the present invention have utility in such a study. 

Polypeptides of the present invention comprising fragments of at least 8 
contiguous amino acids, often at least 15 contiguous amino acids, are useful as 
immunogens for raising antibodies that recognize polypeptides of the present invention. 
See, e.g., Lerner, Nature 299: 592-596 (1982); Shinnick et al, Annu. Rev. Microbiol. 37: 

10 425-46 (1983); Sutcliffe et al, Science 219: 660-6 (1983). As further described in the 
above-cited references, virtually all 8-mers, conjugated to a carrier, such as a protein, 
prove immunogenic and are capable of eliciting antibody for the conjugated peptide; 
accordingly, all fragments of at least 8 amino acids of the polypeptides of the present 
invention have utility as immunogens. 

15 Polypeptides comprising fragments of at least 8, 9, 10 or 12 contiguous amino 

acids are also useful as competitive inhibitors of binding of the entire polypeptide, or a 
portion thereof, to antibodies (as in epitope mapping), and to natural binding partners, 
such as subunits in a multimeric complex or to receptors or ligands of the subject protein; 
this competitive inhibition permits identification and separation of molecules that bind 

20 specifically to the polypeptide of interest. See U.S. Patent Nos. 5,539,084 and 5,783,674, 
incorporated herein by reference in their entireties. 

The polypeptide of the present invention thus preferably is at least 6 amino acids in 
length, typically at least 8, 9, 10 or 12 amino acids in length, and often at least 15 amino 
acids in length. Often, the polypeptide of the present invention is at least 20 amino acids 

25 in length, even 25 amino acids, 30 amino acids, 35 amino acids, or 50 amino acids or more 
in length. Of course, larger polypeptides having at least 75 amino acids, 100 amino acids, 
or even 150 amino acids are also useful, and at times preferred. 

One having ordinary skill in the art can produce fragments by truncating the 
nucleic acid molecule, e.g. 9 a CSNA, encoding the polypeptide and then expressing it 

30 recombinantly. Alternatively, one can produce a fragment by chemically synthesizing a 
portion of the full-length polypeptide. One may also produce a fragment by enzymatically 
cleaving either a recombinant polypeptide or an isolated naturally occurring polypeptide. 
Methods of producing polypeptide fragments are well known in the art See, e.g, 
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Sambrook (1989), supra; Sambrook (2001), supra; Ausubel (1992), supra; and Ausubel 
(1999), supra. In one embodiment, a polypeptide comprising only a fragment, preferably 
a fragment of a CSP, may be produced by chemical or enzymatic cleavage of a CSP 
polypeptide. In a preferred embodiment, a polypeptide fragment is produced by 
5 expressing a nucleic acid molecule of the present invention encoding a fragment, 
preferably of a CSP, in a host cell. 

Polypeptides of the present invention are also inclusive of mutants, fusion proteins, 
homologous proteins and allelic variants. 

A mutant protein, or mutein, may have the same or different properties compared 
10 to a naturally occurring polypeptide and comprises at least one amino acid insertion, 

duplication, deletion, rearrangement or substitution compared to the amino acid sequence 
of a native polypeptide. Small deletions and insertions can often be found that do not alter 
the function of a protein. Muteins may or may not be colon-specific. Preferably, the 
mutein is colon-specific. More preferably the mutein is a polypeptide that comprises at 
15 least one amino acid insertion, duplication, deletion, rearrangement or substitution 
compared to the amino acid sequence of SEQ ID NO: 1 13-259. Accordingly, in a 
preferred embodiment, the mutein is one that exhibits at least 50% sequence identity, more 
preferably at least 60% sequence identity, even more preferably at least 70%, yet more 
preferably at least 80% sequence identity to a CSP comprising an amino acid sequence of 
20 SEQ ID NO: 1 13-259. In a yet more preferred embodiment, the mutein exhibits at least 
85%, more preferably 90%, even more preferably 95% or 96%, and yet more preferably at 
least 97%, 98%, 99% or 99.5% sequence identity to a CSP comprising an amino acid 
sequence of SEQ ID NO: 1 13-259. 

A mutein may be produced by isolation from a naturally occurring mutant cell, 
25 tissue or organism. A mutein may be produced by isolation from a cell, tissue or organism 
that has been experimentally mutagenized. Alternatively, a mutein may be produced by 
chemical manipulation of a polypeptide, such as by altering the amino acid residue to 
another amino acid residue using synthetic or semi-synthetic chemical techniques. In a 
preferred embodiment, a mutein is produced from a host cell comprising a mutated nucleic 
30 acid molecule compared to the naturally occurring nucleic acid molecule. For instance, 
one may produce a mutein of a polypeptide by introducing one or more mutations into a 
nucleic acid molecule of the invention and then expressing it recombinantly. These 
mutations may be targeted, in which particular encoded amino acids are altered, or may be 
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untargeted, in which random encoded amino acids within the polypeptide are altered. 
Muteins with random amino acid alterations can be screened for a particular biological 
activity or property, particularly whether the polypeptide is colon-specific, as described 
below. Multiple random mutations can be introduced into the gene by methods well 
5 known to the art, e.g., by error-prone PCR, shuffling, oligonucleotide-directed 

mutagenesis, assembly PCR, sexual PCR mutagenesis, in vivo mutagenesis, cassette 
mutagenesis, recursive ensemble mutagenesis, exponential ensemble mutagenesis and site- 
specific mutagenesis. Methods of producing muteins with targeted or random amino acid 
alterations are well known in the art See, e.g. 9 Sambrook (1989), supra; Sambrook 
10 (2001), supra; Ausubel (1992), supra; and Ausubel (1999), as well as U.S. Patent No. 
5,223,408, which is herein incorporated by reference in its entirety. 

The invention also contemplates polypeptides that are homologous to a 
polypeptide of the invention. In a preferred embodiment, the polypeptide is homologous 
to a CSP. In an even more preferred embodiment, the polypeptide is homologous to a 
1 5 CSP selected from the group having an amino acid sequence of SEQ ID NO: 1 1 3-259. By 
homologous polypeptide it is meant one that exhibits significant sequence identity to a 
CSP, preferably a CSP having an amino acid sequence of SEQ ID NO: 113-259. By 
significant sequence identity it is meant that the homologous polypeptide exhibits at least 
50% sequence identity, more preferably at least 60% sequence identity, even more 
20 preferably at least 70%, yet more preferably at least 80% sequence identity to a CSP 
comprising an amino acid sequence of SEQ ID NO: 1 13-259. More preferred are 
homologous polypeptides exhibiting at least 85%, more preferably 90%, even more 
preferably 95% or 96%, and yet more preferably at least 97% or 98% sequence identity to 
a CSP comprising an amino acid sequence of SEQ ID NO: 1 13-259. Most preferably, the 
25 homologous polypeptide exhibits at least 99%, more preferably 99.5%, even more 
preferably 99.6%, 99.7%, 99.8% or 99.9% sequence identity to a CSP comprising an 
amino acid sequence of SEQ ID NO: 1 13-259. In a preferred embodiment, the amino acid 
substitutions of the homologous polypeptide are conservative amino acid substitutions as 
discussed supra. 

30 Homologous polypeptides of the present invention also comprise polypeptide 

encoded by a nucleic acid molecule that selectively hybridizes to a CSNA or an antisense 
sequence thereof. In this embodiment, it is preferred that the homologous polypeptide be 
encoded by a nucleic acid molecule that hybridizes to a CSNA under low stringency, 
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moderate stringency or high stringency conditions, as defined herein. More preferred is a 
homologous polypeptide encoded by a nucleic acid sequence which hybridizes to a CSNA 
selected from the group consisting of SEQ ID NO: 1-1 12 or a homologous polypeptide 
encoded by a nucleic acid molecule that hybridizes to a nucleic acid molecule that encodes 
5 a CSP, preferably a CSP of SEQ ED NO: 1 1 3-259 under low stringency, moderate 
stringency or high stringency conditions, as defined herein. 

Homologous polypeptides of the present invention may be naturally ocqurring and 
derived from another species, especially one derived from another primate, such as 
chimpanzee, gorilla, rhesus macaque, or baboon, wherein the homologous polypeptide 

10 comprises an amino acid sequence that exhibits significant sequence identity to that of 
SEQ ID NO: 1 13-259. The homologous polypeptide may also be a naturally occurring 
polypeptide from a human, when the CSP is a member of a family of polypeptides. The 
homologous polypeptide may also be a naturally occurring polypeptide derived from a 
non-primate, mammalian species, including without limitation, domesticated species, eg., 

15 dog, cat, mouse, rat, rabbit, guinea pig, hamster, cow, horse, goat or pig. The homologous 
polypeptide may also be a naturally occurring polypeptide derived from a non-mammalian 
species, such as birds or reptiles. The naturally occurring homologous protein may be 
isolated directly from humans or other species. Alternatively, the nucleic acid molecule 
encoding the naturally occurring homologous polypeptide may be isolated and used to 

20 express the homologous polypeptide recombinantly. The homologous polypeptide may 
also be one that is experimentally produced by random mutation of a nucleic acid 
molecule and subsequent expression of the nucleic acid molecule. Alternatively, the 
homologous polypeptide may be one that is experimentally produced by directed mutation 
of one or more codons to alter the encoded amino acid of a CSP. In a preferred 

25 embodiment, the homologous polypeptide encodes a polypeptide that is a CSP. 

Relatedness of proteins can also be characterized using a second functional test, 
such as the ability of a first protein competitively to inhib it the binding of a second protein 
to an antibody. It is, therefore, another aspect of the present invention to provide isolated 
polpeptides not only identical in sequence to those described with particularity herein, but 

30 also to provide isolated polypeptides ("cross-reactive proteins") that competitively inhibit 
the binding of antibodies to all or to a portion of the isolated polypeptides of the present 
invention. Such competitive inhibition can readily be determined using immunoassays 
well known in the art 
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As discussed above, single nucleotide polymorphisms (SNPs) occur frequently in 
eukaiyotic genomes, and the sequence determined from one individual of a species may 
differ from other allelic forms present within the population. Thus, polypeptides of the 
present invention are also inclusive of those encoded by an allelic variant of a nucleic acid 
5 molecule encoding a CSP. In this embodiment, it is preferred that the polypeptide be 
encoded by an allelic variant of a gene that encodes a polypeptide having the amino acid 
sequence selected from the group consisting of SEQ ED NO: 1 13-259. More preferred is 
that the polypeptide be encoded by an allelic variant of a gene that has the nucleic acid 
sequence selected from the group consisting of SEQ ID NO: 1-112. 
10 Polypeptides of the present invention are also inclusive of derivative polypeptides 

encoded by a nucleic acid molecule according to the instant invention. In this 
embodiment, it is preferred that the polypeptide be a CSP. Also preferred are derivative 
polypeptides having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 113-259 and which has been acetylated, carboxylated, phosphorylated, 
1 5 glycosylated, ubiquitinated or post-translationally modified in another manner. In another 
preferred embodiment, the derivative has been labeled with, e.g. y radioactive isotopes such 
as 125 I, 32 P, 35 S, and 3 H. In another preferred embodiment, the derivative has been labeled 
with fluorophores, chemUuminescent agents, enzymes, and antiligands that can serve as 
specific binding pair members for a labeled ligand. 
20 Polypeptide modifications are well known to those of skill and have been 

described in great detail in the scientific literature. Several particularly common 
modifications, glycosylation, lipid attachment, sulfation, gamma-carboxylation of 
glutamic acid residues, hydroxylation and ADP-ribosylation, for instance, are described in 
most basic texts, such as, for instance Creighton, Protein Structure and Molecular 
25 Properties. 2nd ed., W. H. Freeman and Company (1993). Many detailed reviews are 

available on this subject, such as, for example, those provided by Wold, in Johnson (ed.), 
Posttranslational Covalent Modification of Proteins, pgs. 1-12, Academic Press (1983); 
Seifter et al. 9 Meth. Enzymol. 182: 626-646 (1990) and Rattan et al 9 Ann. N. Y. Acad. Set 
663: 48-62 (1992). 

30 One may determine whether a polypeptide of the invention is likely to be post- 

translationally modified by analyzing the sequence of the polypeptide to determine if there 
are peptide motifs indicative of sites for post-translational modification. There are a 
number of computer programs that permit prediction of post-translational modifications. 
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See, e.g., expasy.org (accessed November 11, 2002) of the world wide web, which 
includes PSORT, for prediction of protein sorting signals and localization sites, SignalP, 
for prediction of signal peptide cleavage sites, MITOPROT and Predotar, for prediction of 
mitochondrial targeting sequences, NetOGlyc, for prediction of type O-glycosylation sites 
5 in mammalian proteins, big-PI Predictor and DGPI, for prediction of prenylation-anchor 
and cleavage sites, and NetPhos, for prediction of Ser, Thr and Tyr phosphorylation sites 
in eukaryotic proteins. Other computer programs, such as those included in GCG, also 
may be used to determine post-translational modification peptide motifs. 

General examples of types of post-translational modifications include, but are not 
10 limited to: (Z)-dehydrobutyrine; 1-chondroitin sulfate-L-aspartic acid ester; I'-glycosyl-L- 
tryptophan; r-phospho-L-histidine; 1-thioglycine; 2 , -(S-L-cysteinyl)-L-histidine; 2'-[3- 
carboxamido (trimethylammonio)propyl]-L-histidine; 2'-alpha-mannosyl-L-tryptophan; 2- 
methyl-L-glutamine; 2-oxobutanoic acid; 2-pyrrolidone carboxylic acid; 3'-(r-L-histidyl)- 
L-tyrosine; 3H8alpha-FAD)-L-histidine; S'-CS-L-cysteiny^-L-tyrosine; 3 1 , 3",5 , -triiodo-L- 
15 thyronine; 3M'-phospho-L-tyrosine; 3-hydroxy-L-proline; 3 f -methyl-L-histidine; 3- 

methyl-L-lanthionine; 3 f -phospho-L-histidine; 4 f -(L-tryptophan)-L-tryptophyl quinone; 42 
N-cysteinyl-glycosylphosphatidylinositolethanolamine; 43 -(T-L-histidyl)-L-tyrosine; 4- 
hydroxy-L-arginine; 4-hydroxy-L-lysine; 4-hydroxy-L-proline; 5 ! -(N6-L-lysine)-L- 
topaquinone; 5-hydroxy-L-lysine; 5-methyl-L-arginine; alpha-l-microglobulin-Ig alpha 
20 complex chromophore; bis-L-cysteinyl bis-L-histidino diiron disulfide; bis-L-cysteinyl-L- 
N3'-histidino-L-serinyI tetrairon' tetrasulfide; chondroitin sulfate D-glucuronyl-D- 
galactosyl-D-galactosyl-D-xylosyl-L-serine; D-aianine; D-allo-isoleucine; D-asparagine; 
dehydroalanine; dehydrotyrosine; dermatan 4-sulfate D-glucuronyl-D-galactosyl-D- 
galactosyl-D-xylosyl-L-serine; D-glucuronyl-N-glycine; dipyrrolylmethanemethyl-L- 
25 cysteine; D-leucine; D-methionine; D-phenylalanine; D-serine; D-tryptophan; glycine 

amide; glycine oxazolecarboxylic acid; glycine thiazolecarboxylic acid; heme P450-bis-L- 
cysteine-L-tyrosine; heme-bis-L-cysteine; hemediol-L-aspartyl ester-L-glutamyl ester; 
hemediol-L-aspartyl ester-L-glutamyl ester-L-methionine sulfonium; heme-L-cysteine; 
heme-L-histidine; heparan sulfate D-glucuronyl-D-galactosyl-D-galactosyl-D-xylosyl-L- 
30 serine; heme P450-bis-L-cysteine-L-lysine; hexakis-L-cysteinyl hexairon hexasulfide; 
keratan sulfate D-glucuronyl-D-galactosyl-D-galactosyl-D-xylosyl-L-threoiim L 
oxoalanine- lactic acid; L phenyllactic acid; r-(8alpha-FAD>-L-histidine; L-2 , .4 ! ,5 I - 
topaquinone; L-S'^'-dihydroxyphenylalanine; L-3\4\5 f -trihydroxyphenylalanine; L-4 f - 
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bromophenylalanine; L-6-bromotryptophan; L-alanine amide; L-alanyl imidazolinone 
glycine; L-allysine; L-arginine amide; L-asparagine amide; L-aspartic 4-phosphoric 
anhydride; L-aspartic acid 1 -amide; L-beta-methylthioaspartic acid; L-bromohistidine; L- 
citrulline; L-cysteine amide; L-cysteine glutathione disulfide; L-cysteine methyl disulfide; 
5 L-cysteine methyl ester; L-cysteine oxazolecarboxylic acid; L-cysteine 

oxazolinecarboxylic acid; L-cysteine persulfide; L-cysteine sulfenic acid; L-cysteine 
sulfuric acid; L-cysteine thiazolecarboxylic acid; L-cysteinyl homocitryl molybdenum- 
heptairon-nonasulfide; L-cysteinyl imidazolinone glycine; L-cysteinyl molybdopterin; L- 
cysteinyl molybdopterin guanine dinucleotide; L-cystine; L-erythro-beta- 

10 hydroxyasparagine; L-erythro-beta-hydroxyaspartic acid; L-gamma-carboxyglutamic acid; 
L-glutamic acid 1 -amide; L-glutamic acid 5-methyl ester; L-glutamine amide; L-glutamyl 
5-glycerylphosphorylethanolamine; L-histidine amide; L-isoglutamyl-polyglutamic acid; 
L-isoglutamyl-polyglycine; L-isoleucine amide; L-lanthionine; L-leucine amide; L-lysine 
amide; L-lysine thiazolecarboxylic acid; L-lysinoalanine; ^methionine amide; L- 

15 methionine sulfone; L-phenyalanine thiazolecarboxylic acid; L-phenylalanine amide; L- 
proline amide; L-selenocysteine; L-selenocysteinyl molybdopterin guanine dinucleotide; 
L-serine amide; L-serine thiazolecarboxylic acid; L-seryl imidazolinone glycine; L-T- 
bromophenylalanine; L-T-bromophenylalanine; L-threonine amide; L-thyroxine; L- 
tryptophan amide; L-tryptophyl quinone; L-tyrosine amide; L-vaUne amide; meso- 

20 lanthionine; N-(L-glutamyl)-L-tyrosine; N-(L-isoaspartyl)-glycine; N-(L-isoaspartyl)-L- 
cysteine; N^^-trimethyl-L-alanine; N,N-dimethyl-L-proline; N2-acetyl-L-lysine; N2- 
succinyl-L-tryptophan; N4-(ADP-ribosyl)-L-asparagine; N4-glycosyl-L-asparagine; N4- 
hydroxymethyl-L-asparagine; N4-methyl-L-asparagine; N5-methyl-L-glutamine; N6- 1 - 
cafboxyethyl-L-lysine; N6-(4-amino hydroxybutyl)-L-lysine; N6-(L-isoglutamyl)-L- 

25 lysine; N6-(phospho-5 f -adenosine)-L-lysine; N6-(phospho-5 f -guanosine)-L-tysine; 

N6,N6,N6-trimethyl-L-lysine; N6,N6-dimethyl-L-lysine; N6-acetyl-L-lysine; N6-biotinyl- 
L-lysine; N6-carboxy-L-lysine; N6-formyl-L-lysine; N6-glycyl-L-lysine; N6-lipoyl-L- 
lysine; N6-methyl-L-lysine; N6-methyl-N6-poly(N-methyl-propylamine)-L-lysine; N6- 
mureinyl-L-lysine; N6-myristoyl-L-lysine; N6-palmitoyl-L-lysine; N6-pyridoxal 

30 phosphate-L-lysine; N6-pyruvic acid 2-iminyl-L-lysine; N6-retinal-L-lysine; N- 

acetylglycine; N-acetyl-L-glutamine; N-acetyl-L-alanine; N-acetyl-L-aspaxtic acid; N- 
acetyl-L-cysteine; N-acetyl-L-glutamic acid; N-acetyl-L-isoleucine; N-acetyl-L- 
methionine; N-acetyl-L-proline; N-acetyl-L-serine; N-acetyl-L-threonine; N-acetyl-L- 
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tyrosine; N-acetyl-L-valine; N-alanyl-glycosylpto N- 
asparaginyl-glycosylphosphatidylinositolethanolarnine; N-aspartyl- 
glycosylphosphatidylinositoletimnolamiiie; N-fonnylglycine; N-formyl-L-methiomne; N- 
gly(^l-glycosylphosphatidylinositolelhanolainiiie; N-L-glutamyl-poly-L-glutamic acid; N- 
5 methylglycine; N-methyl-L-alanine; N-methyl-L-methionine; N-methyl-L-phenylalanine; 
N-myristoyl-glycine; N-palmitoyl-L-cysteine; N-pyruvic acid 2-iminyl-L-cysteine; N- 
pyruvic acid 2-iminyl-L-valine; N-seiyl-glycosylphosphatidylinositoletbanolamine; N- 
seryl-glycosyCSPhingolipidinositole^ 0<ADP-ribosyl)-L-serine; 0-(phosplio- 

5 f -adenosine)-L-threonine; 0-(phospho-5-DNA)-L-serine; 0^)hospho-5 f -DNA)-L- 
10 threonine; 0-(phospho-5VRNA)-L-serine; 0-(phosphoribosyl dephospho-coenzyme A)-L- 
serine; 0-(sn4-glycerophosphoryl)-L-serine; 04 , -(8alpha-FAD)-L-tyrosine; 04'-(phospho- 
5 , -adenosine)-L-tyrosine; 04 , -(phospho-5 , -DNA)-L-tyrosine; 04'-(phospho-5 '-RNA)-L- 
tyrosine; 04 l -(phosplio-5 l -uridine)-Iy-tyrosine; 04-glycosyl-L-hydroxyproline; 04*- 
glycosyl-L-tyrosine; 04 , -sulfo-L-tyrosine; OS-glycosyl-L-hydroxylysine; O-glycosyl-L- 
15 serine; O-glycosyl-L-threonine; omega-N-(ADP-ribosyl)-L-arginine; omega-N-omega-N 1 - 
dimethyl-L-arginine; omega-N-methyl-L-arginine; omega-N-omega-N-dimethyl-L- 
arginine; omega-N-phospho-L-arginine; O f octanoyl-L-serine; O-palmitoyl-L-serine; O- 
palmitoyl-L-threonine; O-phospho-L-serine; O-phospho-L-threonine; O- 
phosphopantetheine-L-serine; phycoerythrobilin-bis-L-cysteine; phycourobilin-bis-L- 
20 cysteine; pyrroloquinoline quinone; pyruvic acid; S hydroxycinnainyl-L-cysteine; S-(2- 
aminovinyl) methyl-D-eysteine; S-(2-aminovinyl)-D-cysteine; S-(6-FW-L-cysteine; S- 
(8alpha-FAD)-L-cysteine; S-(ADP-ribosyl)-L-cysteine; S-(L-isoglutamyl)-L-cysteine; S- 
1 2-hydroxyfarnesyl-L-cysteine; S-ace1yl-L-cysteine; S-diacylglycerol-L-cysteine; S- 
diphytanylglycerot diether-L-cysteine; S-farnesyl-L-cysteine; S-geranylgeranyl-L- 
25 cysteine; S-glycosyl-L-cysteine; S-glycyl-L-cysteine; S-methyl-L-cysteine; S-nitrosyl-L- 
cysteine; S-palmitoyl-L-cysteine; S-phospho-L-cysteine; S-phycobiliviolin-L-cysteine; S- 
phycocyanobilin-L-cysteine; S-phycoerythrobilin-L-cysteine; S-phytochromobilin-L- 
cysteine; S-selenyl-L-cysteine; S-sulfo-L-cysteine; tetrakis-L-cysteinyl diiron disulfide; 
tetrakis-L-cysteinyl iron; tetrakis-L-cysteinyl tetrairon tetrasulfide; trans-2 9 3-cis 4- 
30 dihydroxy-L-proline; tris-L-cysteinyl triiron tetrasulfide; tris-L-cysteinyl triiron trisulfide; 
tris-L-cysteinyl-L-aspartato tetrairon tetrasulfide; tris-L-cysteinyl~L-cysteine persulfido- 
bis-L-glutamato-L-histidino tetrairon disulfide trioxide; tris-L-cysteinyl-L-N3-histidino 
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tetrairon tetrasulfide; tris-L-cysteinyl-L-Nl'-histidino tetrairon tetrasulfide; and tris-L- 
cysteinyl-L~serinyl tetrairon tetrasulfide. 

Additional examples of PTMs may be found in web sites such as the Delta Mass 
database based on Krishna, R. G. and F. Wold (1998). Posttranslational Modifications. 

5 Proteins - Analysis and Design. R. H. Angeletti. San Diego, Academic Press. 1: 121-206; 
Methods in Enzymology, 193, J.A. McClosky (ed) (1990), pages 647-660; Methods in 
Protein Sequence Analysis edited by Kazutomo Imahori and Fumio Sakiyama, Plenum 
Press, (1993) "Post-translational modifications of proteins" R.G. Krishna and F. Wold 
pages 167-172; "GlycoSuiteDB: a new curated relational database of glycoprotein glycan 

10 structures and their biological sources" Cooper et al. Nucleic Acids Res. 29; 332-335 

(2001) "O-GLYCBASE version 4.0: a revised database of O-glycosylated proteins" Gupta 
et al. Nucleic Acids Research, 27: 370-372 (1999); and "PhosphoBase, a database of 
phosphorylation sites: release 2.0.", Kreegipuu et al.Nucleic Acids Res 27(l):237-239 
(1999) see also,WO 02/21 139A2, the disclosure of which is incorporated herein by 

15 reference in its entirety. 

Tumorigenesis is often accompanied by alterations in the post-translational 
modifications of proteins. Thus, in another embodiment, the invention provides 
polypeptides from cancerous cells or tissues that have altered post-translational 
modifications compared to the post-translational modifications of polypeptides from 

20 normal cells or tissues. A number of altered post-translational modifications are known. 
One common alteration is a change in phosphorylation state, wherein the polypeptide from 
the cancerous cell or tissue is hyperphosphorylated or hypophosphorylated compared to 
the polypeptide from a normal tissue, or wherein the polypeptide is phosphorylated on 
different residues than the polypeptide from a normal cell. Another common alteration is 

25 a change in glycosylation state, wherein the polypeptide from the cancerous cell or tissue 
has more or less glycosylation than the polypeptide from a normal tissue, and/or wherein 
the polypeptide from the cancerous cell or tissue has a different type of glycosylation than 
the polypeptide from a noncancerous cell or tissue. Changes in glycosylation may be 
critical because carbohydrate-protein and carbohydrate-carbohydrate interactions are 

30 important in cancer cell progression, dissemination and invasion. See, e.g., Barchi, Curr. 
Phann. Des. 6: 485-501 (2000), Verma, Cancer Biochem. Biophys. 14: 151-162 (1994) 
and Dennis et al, Bioessays 5: 412-421 (1999). 
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Another post-translational modification that may be altered in cancer cells is 
prenylation. Prenylation is the covalent attachment of a hydrophobic prenyl group (either 
farnesyl or geranylgeranyl) to a polypeptide. Prenylation is required for localizing a 
protein to a cell membrane and is often required for polypeptide function. For instance, 
5 the Ras superfamily of GTPase signalling proteins must be prenylated for function in a 
cell. See, e.g., Prendergast et al., Semin. Cancer Biol. 10: 443-452 (2000) and Khwaja et 
al., Lancet 355: 741-744 (2000). 

Other post-translation modifications that may be altered in cancer cells include, 
without limitation, polypeptide methylation, acetylation, arginylation or racemization of 

10 amino acid residues. In these cases, the polypeptide from the cancerous cell may exhibit 
either increased or decreased amounts of the post-translational modification compared to 
the corresponding polypeptides from noncancerous cells. 

Other polypeptide alterations in cancer cells include abnormal polypeptide 
cleavage of proteins and aberrant protein-protein interactions. Abnormal polypeptide 

15 cleavage may be cleavage of a polypeptide in a cancerous cell that does not usually occur 
in a normal cell, or a lack of cleavage in a cancerous cell, wherein the polypeptide is 
cleaved in a normal cell. Aberrant protein-protein interactions may be either covalent 
cross-linking or non-covalent binding between proteins that do not normally bind to each 
other. Alternatively, in a cancerous cell, a protein may fail to bind to another protein to 

20 which it is bound in a noncancerous cell. Alterations in cleavage or in protein-protein 
interactions may be due to over- or underproduction of a polypeptide in a cancerous cell 
compared to that in a normal cell, or may be due to alterations in post-translational 
modifications (see above) of one or more proteins in the cancerous cell. See, e.g., 
Henschen-Edman^/w. N.Y. Acad. Sci. 936: 580-593 (2001). 

25 Alterations in polypeptide post-translational modifications, as well as changes in 

polypeptide cleavage and protein-protein interactions, may be determined by any method 
known in the art. For instance, alterations in phosphorylation may be determined by using 
anti-phosphoserine, anti-phosphothreonine or anti-phosphotyrosine antibodies or by amino 
acid analysis. Glycosylation alterations may be determined using antibodies specific for 

30 different sugar residues, by carbohydrate sequencing, or by alterations in the size of the 
glycoprotein, which can be determined by,, e.g., SDS polyacrylamide gel electrophoresis 
(PAGE). Other alterations of post-translational modifications, such as prenylation, 
racemization, methylation, acetylation and arginylation, may be determined by chemical 
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analysis, protein sequencing, amino acid analysis, or by using antibodies specific for the 
particular post-translational modifications. Changes in protein-protein interactions and in 
polypeptide cleavage may be analyzed by any method known in the art including, without 
limitation, non-denaturing PAGE (for non-covalent protein-protein interactions), SDS 
5 PAGE (for covalent protein-protein interactions and protein cleavage), chemical cleavage, 
protein sequencing or immunoassays. 

In another embodiment, the invention provides polypeptides that have been post- 
translationally modified. In one embodiment, polypeptides may be modified 
enzymatically or chemically, by addition or removal of a post-translational modification. 

10 For example, a polypeptide may be glycosylated or deglycosylated enzymatically. 

Similarly, polypeptides may be phosphorylated using a purified kinase, such as a MAP 
kinase (e.g, p38, ERK, or JNK) or a tyrosine kinase (e.g., Src or erbB2). A polypeptide 
may also be modified through synthetic chemistry. Alternatively, one may isolate the 
polypeptide of interest from a cell or tissue that expresses the polypeptide with the desired 

15 post-translational modification. In another embodiment, a nucleic acid molecule encoding 
the polypeptide of interest is introduced into a host cell that is capable of post- 
translationally modifying the encoded polypeptide in the desired fashion. If the 
polypeptide does not contain a motif for a desired post-translational modification, one may 
alter the post-translational modification by mutating the nucleic acid sequence of a nucleic 

20 acid molecule encoding the polypeptide so that it contains a site for the desired post- 
translational modification. Amino acid sequences that may be post-translationally 
modified are known in the art. See, e.g., the programs described above on the website 
expasy.org of the world wide web. The nucleic acid molecule may also be introduced into 
a host cell that is capable of post-translationally modifying the encoded polypeptide. 

25 Similarly, one may delete sites that are post-translationally modified by either mutating 
the nucleic acid sequence so that the encoded polypeptide does not contain the post- 
translational modification motif, or by introducing the native nucleic acid molecule into a 
host cell that is not capable of post-translationally modifying the encoded polypeptide. 

It will be appreciated, as is well known and as noted above, that polypeptides are 

30 not always entirely linear. For instance, polypeptides may be branched as a result of 

ubiquitination, and they may be circular, with or without branching, generally as a result 
of posttranslation events, including natural processing events and events brought about by 
human manipulation which do not occur naturally. Circular, branched and branched 
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circular polypeptides may be synthesized by non-translation natural processes and by 
entirely synthetic methods, as well. Modifications can occur anywhere in a polypeptide, 
including the peptide backbone, the amino acid side-chains and the amino or carboxyl 
termini. In fact, blockage of the amino or carboxyl group in a polypeptide, or both, by a 
5 covalent modification, is common in naturally occurring and synthetic polypeptides and 
such modifications may be present in polypeptides of the present invention, as well. For 
instance, the amino terminal residue of polypeptides made in E. coli, prior to proteolytic 
processing, almost invariably will be N-formylmethionine. 

Useful post-synthetic (and post-translational) modifications include conjugation to 
10 detectable labels, such as fluorophores. A wide variety of amine-reactive and thiol- 
reactive fluorophore derivatives have been synthesized that react under nondenaturing 
conditions with N-terminal amino groups and epsilon amino groups of lysine residues, on 
the one hand, and with free thiol groups of cysteine residues, on the other. 

Kits are available commercially that permit conjugation of proteins to a variety of 
15 amine-reactive or thiol-reactive fluorophores: Molecular Probes, Inc. (Eugene, OR, USA), 
e.g., offers kits for conjugating proteins to Alexa Fluor 350, Alexa Fluor 430, 
Fluorescein-EX, Alexa Fluor 488, Oregon Green 488, Alexa Fluor 532, Alexa Fluor 546, 
Alexa Fluor 546, Alexa Fluor 568, Alexa Fluor 594, and Texas Red-X. 

A wide variety of other amine-reactive and thiol-reactive fluorophores are 
20 available commercially (Molecular Probes, Inc., Eugene, OR, USA), including Alexa 

Fluor® 350, Alexa Fluor® 488, Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 568, 
Alexa Fluor® 594, Alexa Fluor® 647 (monoclonal antibody labeling kits available from 
Molecular Probes, Inc., Eugene, OR, USA), BODIPY dyes, such as BODIPY 493/503, 
BODIPY FL, BODIPY R6G, BODIPY 530/550, BODIPY TMR, BODIPY 558/568, 
25 BODIPY 558/568, BODIPY 564/570, BODIPY 576/589, BODIPY 581/591, BODIPY 
TR, BODIPY 630/650, BODIPY 650/665, Cascade Blue, Cascade Yellow, Dansyl, 
lissamine rhodamine B, Marina Blue, Oregon Green 488, Oregon Green 514, Pacific Blue, 
rhodamine 6G, rhodamine green, rhodamine red, tetramethylrhodamine, Texas Red 
(available from Molecular Probes, Inc., Eugene, OR, USA). 
30 The polypeptides of the present invention can also be conjugated to fluorophores, 

other proteins, and other macromolecules, using bifunctional linking reagents. Common 
homobifunctional reagents include, e.g., APG, AEDP, BASED, BMB, BMDB, BMH, 
BMOE, BM[PEO]3, BM[PEO]4, BS3, BSOCOES, DFDNB, DMA, DMP, DMS, DPDPB, 
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DSG, DSP (Lomant's Reagent), DSS, DST, DTBP, DTME, DTSSP, EGS, HBVS, 
Sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS (all available from Pierce, Rockford, IL, USA); 
common heterobifunctional cross-linkers include ABH, AMAS, ANB-NOS, APDP, 
ASBA, BMPA, BMPH, BMPS, EDC, EMCA, EMCH, EMCS, KMUA, KMUH, GMBS, 
5 LC-SMCC, LC-SPDP, MBS, M2C2H, MPBH, MSA, NHS-ASA, PDPH, PMPI, SADP, 
SAED, SAND, SANPAH, SASD, SATP, SBAP, SFAD, SIA, SIAB, SMCC, SMPB, 
SMPH, SMPT, SPDP, Sulfo-EMCS, Sulfo-GMBS, Sulfo-HSAB, Sulfo-KMUS, 
Sulfo-LC-SPDP, Sulfo-MBS, Sulfo-NHS-LC-ASA, Sulfo-SADP, Sulfo-SANPAH, 
Sulfo-SIAB, Sulfo-SMCC, Sulfo-SMPB, Sulfo-LC-SMPT, SVSB, TFCS (all available 

10 Pierce, Rockford, IL, USA). 

Polypeptides of the present invention, including full length polypeptides, 
fragments and fusion proteins, can be conjugated, using such cross-linking reagents, to 
fluorophores that are not amine- or thiol-reactive. Other labels that usefully can be 
conjugated to polypeptides of the present invention include radioactive labels, 

1 5 echosonographic contrast reagents, and MRI contrast agents. 

Polypeptides of the present invention, including full length polypeptides, 
fragments and fusion proteins, can also usefully be conjugated using cross-linking agents 
to carrier proteins, such as KLH, bovine thyroglobulin, and even bovine serum albumin 
(BSA), to increase immunogenicity for raising anti-CSP antibodies. 

20 Polypeptides of the present invention, including full length polypeptides, 

fragments and fusion proteins, can also usefully be conjugated to polyethylene glycol 
(PEG); PEGylation increases the serum half life of proteins administered intravenously for 
replacement therapy. Delgado et aL, Crit Rev. Ther. Drug Carrier Syst. 9(3-4): 249-304 
(1992); Scott et aL, Curr. Pharm. Des. 4(6): 423-38 (1998); DeSantis et aL, Curr. Opin. 

25 Biotechnol 10(4): 324-30 (1999). PEG monomers can be attached to the protein directly 
or through a linker, with PEGylation using PEG monomers activated with tresyl chloride 
(2,2,2-trifluoroethanesulphonyl chloride) permitting direct attachment under mild 
conditions. 

Polypeptides of the present invention are also inclusive of analogs of a polypeptide 
30 encoded by a nucleic acid molecule according to the instant invention. In a preferred 

embodiment, this polypeptide is a CSP. In a more preferred embodiment, this polypeptide 
is derived from a polypeptide having part or all of the amino acid sequence of SEQ ID 
NO: 113-259. Also preferred is an analog polypeptide comprising one or more 
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substitutions of non-natural amino acids or non-native inter-residue bonds compared to the 
naturally occurring polypeptide. In one embodiment, the analog is structurally similar to a 
CSP, but one or more peptide linkages is replaced by a linkage selected from the group 
consisting of ~CH 2 NH-, ~CH 2 S~, ~CH 2 -CH 2 ~, -CH=CH-(cis and trans), ~COCH 2 ~, 
5 — CH(OH)CH 2 — and -CH 2 SO— . In another embodiment, the analog comprises 

substitution of one or more amino acids of a CSP with a D-amino acid of the same type or 
other non-natural amino acid in order to generate more stable peptides. D-amino acids can 
readily be incorporated during chemical peptide synthesis: peptides assembled from 
D-amino acids are more resistant to proteolytic attack; incorporation of D-amino acids can 

10 also be used to confer specific three-dimensional conformations on the peptide. Other 
amino acid analogues commonly added during chemical synthesis include ornithine, 
norleucine, phosphorylated amino acids (typically phosphoserine, phosphothreonine, 
phosphotyrosine), L-malonyltyrosine, a non-hydrolyzable analog of phosphotyrosine (see, 
e.g., Kole et al. 9 Biochem. Biophys. Res. Com. 209: 817-821 (1995)), and various 

1 5 halogenated phenylalanine derivatives. 

Non-natural amino acids can be incorporated during solid phase chemical synthesis 
or by recombinant techniques, although the former is typically more common. Solid 
phase chemical synthesis of peptides is well established in the art Procedures are 
described, inter alia, in Chan et al. (eds.), Fmoc Solid Phase Peptide Synthesis: A 

20 Practical Approach (Practical Approach Series), Oxford Univ. Press (March 2000); Jones, 
Amino Acid and Peptide Synthesis (Oxford Chemistry Primers, No 7), Oxford Univ. Press 
(1992); and Bodanszky, Principles of Peptide Synthesis (Springer Laboratory), Springer 
Verlag(1993). 

Amino acid analogues having detectable labels are also usefully incorporated 
25 during synthesis to provide derivatives and analogs. Biotin, for example can be added 
using biotinoyl-(9-fluorenylmethoxycarbonyl)-L-lysine (FMOC biocytin) (Molecular 
Probes, Eugene, OR, USA). Biotin can also be added enzymatically by incorporation into 
a fusion protein of an E. colt BirA substrate peptide. The FMOC and fBOC derivatives of 
dabcyl-L-lysine (Molecular Probes, Inc., Eugene, OR, USA) can be used to incorporate 
30 the dabcyl chromophore at selected sites in the peptide sequence during synthesis. The 
aminonaphthalene derivative EDANS, the most common fluorophore for pairing with the 
dabcyl quencher in fluorescence resonance energy transfer (FRET) systems, can be 
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introduced during automated synthesis of peptides by using EDANS-FMOC-L-glutamic 
acid or the corresponding fflOC derivative (both from Molecular Probes, Inc., Eugene, 
OR, USA). Tetramethylrhodamine fluorophores can be incorporated during automated 
FMOC synthesis of peptides using (FMOC)-TMR-L-lysine (Molecular Probes, Inc. 
5 Eugene, OR, USA). 

Other useful amino acid analogues that can be incorporated during chemical 
synthesis include aspartic acid, glutamic acid, lysine, and tyrosine analogues having allyl 
side-chain protection (Applied Biosystems, Inc., Foster City, CA, USA); the allyl side 
chain permits synthesis of cyclic, branched-chain, sulfonated, glycosylated, and 

1 0 . phosphoiylated peptides. 

A large number of other FMOC-protected non-natural amino acid analogues 
capable of incorporation during chemical synthesis are available commercially, including, 
e.g. 9 Fmoc-2-aminobicyclo[2.2.1]heptane-2-carboxylic acid, Fmoc-3-endo- 
aminobicyclo[2.2.1]heptane-2-endo-carboxylic acid, Fmoc-3-exo- 

1 5 aminobicyclo[2.2. 1 ]heptane-2-exo-carboxylic acid, Fmoc-3 -endo-amino- 

bicyclo[2.2. l]hept-5-ene-2-endo-carboxylic acid, Fmoc-3-exo-amino-bicyclo[2.2. l]hept- 
5-ene-2-exo-carboxylic acid, Fmoc-cis-2-amino-l-cyclohexanecarboxylic acid, Fmoc- 
trans-2-amino-l-cyclohexanecarboxylic acid, Fmoc- 1 -amino- 1 -cyclopentanecarboxylic 
acid, Fmoc-cis-2-amino- 1 -cyclopentanecarboxylic acid, Fmoc- 1 -amino- 1- 

20 cyclopropanecarboxylic acid, Fmoc-D-2-amino-4-(ethylthio)butyric acid, Fmoc-L-2- 

amino-4-(etiiylthio)butyric acid, Fmoc-L-buthionine, Fmoc-S-methyl-L-Cysteine, Fmoc- 
2-aminobenzoic acid (anthranillic acid), Fmoc-3-aminobenzoic acid, Fmoc-4- 
aminobenzoic acid, Fmoc-2-aminobenzophenone-2'-carboxylic acid, Fmoc-N-(4- 
aminobenzoyl)-p-alanine, Fmoc-2-amino-4,5-dimethoxybenzoic acid, Fmoc-4- 

25 aminohippuric acid, Fmoc-2-amino-3-hydroxybenzoic acid, Fmoc-2-amino-5- 
hydroxybenzoic acid, Fmoc-3-amino-4-hydroxybenzoic acid, Fmoc-4-amino-3- 
hydroxybenzoic acid, Fmoc-4-amino-2-hydroxybenzoic acid, Fmoc-5-amino-2- 
hydroxybenzoic acid, Fmoc-2-amino-3-methoxybenzoic acid, Fmoc-4-amino-3- 
methoxybenzoic acid, Fmoc-2-amino-3-methylbenzoic acid, Fmoc-2-amino-5- 

30 methylbenzoic acid, Fmoc-2-amino-6-methylbenzoic acid, Fmoc-3-amino-2- 
methylbenzoic acid, Fmoc-3-aminb-4-mediylbenzoic acid, Fmoc-4-amino-3 - 
methylbenzoic acid, Fmoc-3-amino-2-naphtoic acid, Fmoc-D^L-3 -amino-3 - 
phenylpropionic acid, Fmoc-L-Methyldopa, Fmoc-2-amino-4,6-dimethyl-3- 
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pyridinecarboxylic acid, Fmoc-D,L-amino-2-thiophenacetic acid, Fmoc-4- 
(carboxymefhyl)piperazine, Fmoc-4-carboxypiperazine, Fmoc-4- 
(carboxymethyl)homopiperazine, Fmoc-4-phenyl-4-piperidinecarboxylic acid, Fmoc-L- 
l,2,3,4-tetrahydronorhannan-3-carboxylic acid, Fmoc-L-thiazolidine-4-carboxylic acid, all 
5 available from The Peptide Laboratory (Richmond, CA, USA). 

Non-natural residues can also be added biosynthetically by engineering a 
suppressor tRNA, typically one that recognizes the UAG stop codon, by chemical 
aminoacylation with the desired unnatural amino acid. Conventional site-directed 
mutagenesis is used to introduce the chosen stop codon UAG at the site of interest in the 
10 protein gene. When the acylated suppressor tRNA and the mutant gene are combined in 
an in vitro transcription/translation system, the unnatural amino acid is incorporated in 
response to the UAG codon to give a protein containing that amino acid at the specified 
position. Liu et al, Proc. Natl Acad. Set USA 96(9): 4780-5 (1999); Wang et aL, Science 
292(5516): 498-500 (2001). 

1 5 Fusion Proteins 

Another aspect of the present invention relates to the fusion of a polypeptide of the 
present invention to heterologous polypeptides. la a preferred embodiment, the 
polypeptide of the present invention is a CSP. In a more preferred embodiment, the 
polypeptide of the present invention that is fused to a heterologous polypeptide which 

20 comprises part or all of the amino acid sequence of SEQ ED NO: 1 1 3-259, or is a mutein, 
homologous polypeptide, analog or derivative thereof. In an even more preferred 
embodiment, the fusion protein is encoded by a nucleic acid molecule comprising all or 
part of the nucleic acid sequence of SEQ ED NO: 1-1 12, or comprises all or part of a 
nucleic acid sequence that selectively hybridizes or is homologous to a nucleic acid 

25 molecule comprising a nucleic acid sequence of SEQ ID NO: 1-1 12. 

The fusion proteins of the present invention will include at least one fragment of a 
polypeptide of the present invention, which fragment is at least 6, typically at least 8, often 
at least 15, and usefully at least 16, 17, 18, 19, or 20 amino acids long. The fragment of 
the polypeptide of the present to be included in the fusion can usefully be at least 25 

30 amino acids long, at least 50 amino acids long, and can be at least 75, 100, or even 150 
amino acids long. Fusions that include the entirety of a polypeptide of the present 
invention have particular utility. 
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The heterologous polypeptide included within the fusion protein of the present 
invention is at least 6 amino acids in length, often at least 8 amino acids in length, and 
preferably at least 15, 20, or 25 amino acids in length. Fusions that include larger 
polypeptides, such as the IgG Fc region, and even entire proteins (such as GFP 

5 chromophore-containing proteins) are particularly useful. 

As described above in the description of vectors and expression vectors of the 
present invention, which discussion is incorporated here by reference in its entirety, 
heterologous polypeptides to be included in the fusion proteins of the present invention 
can usefully include those designed to facilitate purification and/or visualization of 

10 recombinantly-expressed proteins. See, e.g., Ausubel, Chapter 16, (1992), supra. 
Although purification tags can also be incorporated into fusions that are chemically 
synthesized, chemical synthesis typically provides sufficient purity that further 
purification by HPLC suffices; however, visualization tags as above described retain their 
utility even when the protein is produced by chemical synthesis, and when so included 

1 5 render the fusion proteins of the present invention useful as directly detectable markers of 
the presence of a polypeptide of the invention. 

As also discussed above, heterologous polypeptides to be included in the fusion 
proteins of the present invention can usefully include those that facilitate secretion of 
recombinantly expressed proteins into the periplasmic space or extracellular milieu for 

20 prokaryotic hosts or into the culture medium for eukaryotic cells through incorporation of 
secretion signals and/or leader sequences. For example, a His 6 tagged protein can be 
purified on a Ni affinity column and a GST fusion protein can be purified on a glutathione 
affinity column. Similarly, a fusion protein comprising the Fc domain of IgG can be 
purified on a Protein A or Protein G column and a fusion protein comprising an epitope 

25 tag such as myc can be purified using an immunoaffinity column containing an anti-c-myc 
antibody. It is preferable that the epitope tag be separated from the protein encoded by the 
essential gene by an enzymatic cleavage site that can be cleaved after purification. See 
also the discussion of nucleic acid molecules encoding fusion proteins that may be 
expressed on the surface of a cell. 

30 Other useful fusion proteins of the present invention include those that permit use 

of the polypeptide of the present invention as bait in a yeast two-hybrid system. See 
Bartel et al. (eds.), The Yeast Two-Hvbrid System. Oxford University Press (1997); Zhu 
et al., Yeast Hybrid Technologies, Eaton Publishing (2000); Fields et al., Trends Genet. 
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10(8): 286-92 (1994); Mendelsohn et al 9 Curr. Opin. BiotechnoL 5(5): 482-6 (1994); 

Luban et al 9 Curr. Opin. BiotechnoL 6(1): 59-64 (1995); Allen et aL 9 Trends Biochem. 

ScL 20(12): 511-6 (1995); Drees, Curr. Opin. Chem. Biol 3(1): 64-70 (1999); Topcu et 

al 9 Pharm. Res. 17(9): 1049-55 (2000); Fashena etal. 9 Gene 250(1-2): 1-14 (2000); Colas 
5 et al 9 Nature 380, 548-550 (1996); Norman, T. et al 9 Science 285, 591-595 (1999); 

Fabbrizio et al 9 Oncogene 18, 4357-4363 (1999); Xu et al 9 Proc Natl Acad Sci USA. 

94, 12473-12478 (1997); Yang, et al, Nuc. Acids Res. 23, 1 152-1 156 (1995); Kolonin et 

al 9 Proc Natl Acad Sci USA 95, 14266-14271 (1998); Cohen etal 9 Proc Natl Acad Sci U 

SA 95, 14272-14277 (1998); Uetz, et al. Nature 403, 623-627(2000); Ito, et al 9 Proc Natl 
10 Acad Sci USA98 9 4569-4574 (2001). Typically, such fusion is to either E. coli LexA or 

yeast GAL4 DNA binding domains. Related bait plasmids are available that express the 

bait fused to a nuclear localization signal. 

Other useful fusion proteins include those that permit display of the encoded 

polypeptide on the surface of a phage or cell, fusions to intrinsically fluorescent proteins, 
15 such as green fluorescent protein (GFP), and fusions to the IgG Fc region, as described 

above. 

The polypeptides of the present invention can also usefully be fused to protein 
toxins, such as Pseudomonas exotoxin A, diphtheria toxin, Shiga toxin A, anthrax toxin 
lethal factor, or ricin, in order to effect ablation of cells that bind or take up the proteins of 

20 the present invention. 

Fusion partners include, inter dlia 9 myc 9 hemagglutinin (HA), GST, 
immunoglobulins, p-galactosidase, biotin trpE, protein A, (3-lactamase, a-amylase, 
maltose binding protein, alcohol dehydrogenase, polyhistidine (for example, six histidine 
at the amino and/or carboxyl terminus of the polypeptide), lacZ, green fluorescent protein 

25 (GFP), yeast a mating factor, GAL4 transcription activation or DNA binding domain, 
luciferase, and serum proteins such as ovalbumin, albumin and the constant domain of 
IgG. See 9 e.g. 9 Ausubel (1992), supra and Ausubel (1999), supra. Fusion proteins may 
also contain sites for specific enzymatic cleavage, such as a site that is recognized by 
enzymes such as Factor XIII, trypsin, pepsin, or any other enzyme known in the art 

30 Fusion proteins will typically be made by either recombinant nucleic acid methods, as 
described above, chemically synthesized using techniques well known in the art (e.g. t a 
Merrifield synthesis), or produced by chemical cross-linking. 
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Another advantage of fusion proteins is that the epitope tag can be used to bind the 
fusion protein to a plate or column through an affinity linkage for screening binding 
proteins or other molecules that bind to the CSP. 

As further described below, the polypeptides of the present invention can readily 
5 be used as specific immunogens to raise antibodies that specifically recognize 
polypeptides of the present invention including CSPs and their allelic variants and 
homologues. The antibodies, in turn, can be used, inter alia, specifically to assay for the 
polypeptides of the present invention, particularly CSPs, e.g. by EIISA for detection of 
protein fluid samples, such as serum, by immunohistochemistry or laser scanning 

10 cytometry, for detection of protein in tissue samples, or by flow cytometry, for detection 
of intracellular protein in cell suspensions, for specific antibody-mediated isolation and/or 
purification of CSPs, as for example by immunoprecipitation, and for use as specific 
agonists or antagonists of CSPs. 

One may determine whether polypeptides of the present invention including CSPs, 

1 5 muteins, homologous proteins or allelic variants or fusion proteins of the present invention 
are functional by methods known in the art. For instance, residues that are tolerant of 
change while retaining function can be identified by altering the polypeptide at known 
residues using methods known in the art, such as alanine scanning mutagenesis, 
Cunningham et al, Science 244(4908): 1081-5 (1989); transposon linker scanning 

20 mutagenesis, Chen et aL 9 Gene 263(1-2): 39-48 (2001); combinations of homolog- and 
alanine-scanning mutagenesis, Jin et al. 9 J. Mol Biol 226(3): 851-65 (1992); and 
combinatorial alanine scanning, Weiss et al 9 Proc. Natl Acad. Sci USA 97(16): 8950-4 
(2000), followed by functional assay. Transposon linker scanning kits are available 
commercially (New England Biolabs, Beverly, MA, USA, catalog, no. E7-102S; 

25 EZ::TN™ In-Frame Linker Insertion Kit, catalogue no. EZI04KN, (Epicentre 
Technologies Corporation, Madison, WI, USA). 

Purification of the polypeptides or fusion proteins of the present invention is well 
known and within the skill of one having ordinary skill in the art See, e.g, Scopes, 
Protein P urification. 2d ed. (1987). Purification of recombinantly expressed polypeptides 

30 is described above. Purification of chemically-synthesized peptides can readily be 
effected, e.g, by HPLC. 

Accordingly, it is an aspect of the present invention to provide the isolated 
polypeptides or fusion proteins of the present invention in pure or substantially pure form 
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in the presence or absence of a stabilizing agent. Stabilizing agents include both 
proteinaceous and non-proteinaceous material and axe well known in the art Stabilizing 
agents, such as albumin and polyethylene glycol (PEG) are known and are commercially 
available. 

5 Although high levels of purity are preferred when the isolated polypeptide or 

fusion protein of the present invention are used as therapeutic agents, such as in vaccines 
and replacement therapy, the isolated polypeptides of the present invention are also useful 
at lower purity. For example, partially purified polypeptides of the present invention can 
be used as immunogens to raise antibodies in laboratory animals. 

10 In a preferred embodiment, the purified and substantially purified polypeptides of 

the present invention are in compositions that lack detectable ampholytes, acrylamide 
monomers, bis-acrylamide monomers, and polyacrylamide. 

The polypeptides or fusion proteins of the present invention can usefully be 
attached to a substrate. The substrate can be porous or solid, planar or non-planar; the 

15 bond can be covalent or noncovalent For example, the peptides of the invention may be 
stabilized by covalent linkage to albumin. See, U.S. Patent No. 5,876,969, the contents of 
which are hereby incorporated in its entirety. 

The polypeptides or fusion proteins of the present invention can also be usefully 
bound to a porous substrate, commonly a membrane, typically comprising nitrocellulose, 

20 polyvinylidene fluoride (PVDF), or cationically derivatized, hydrophilic PVDF; so bound, 
the polypeptides or fusion proteins of the present invention can be used to detect and 
quantify antibodies, e.g. in serum, that bind specifically to the immobilized polypeptide or 
fusion protein of the present invention. 

As another example, the polypeptides or fusion proteins of the present invention 

25 can usefully be bound to a substantially nonporous substrate, such as plastic, to detect and 
quantify antibodies, e.g. in serum, that bind specifically to the immobilized protein of the 
present invention. Such plastics include polymethylacrylic, polyethylene, polypropylene, 
polyacrylate, polymethylmethacrylate, polyvinylchloride, polytetrafluoroethylene, 
polystyrene, polycarbonate, polyacetal, polysulfone, celluloseacetate, cellulosenitrate, 

30 nitrocellulose, or mixtures thereof; when the assay is performed in a standard microtiter 
dish, the plastic is typically polystyrene. 

The polypeptides and fusion proteins of the present invention can also be attached 
to a substrate suitable for use as a surface enhanced laser desorption ionization source; so 
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attached, the polypeptide or fusion protein of the present invention is useful for binding 
and then detecting secondary proteins that bind with sufficient affinity or avidity to the 
surface-bound polypeptide or fusion protein to indicate biologic interaction there between. 
The polypeptides or fusion proteins of the present invention can also be attached to a 
5 substrate suitable for use in surface plasmon resonance detection; so attached, the 
polypeptide or fusion protein of the present invention is useful for binding and then 
detecting secondary proteins that bind with sufficient affinity or avidity to the surface- 
bound polypeptide or fusion protein to indicate biological interaction there between. 

Alternative Transcripts 

10 In antother aspect, the present invention provides splice variants of genes and 

proteins encoded thereby. The identification of a novel splice variant which encodes an 
amino acid sequence with a novel region can be targeted for the generation of reagents for 
use in detection and/or treatment of cancer. The novel amino acid sequence may lead to a 
unique protein structure, protein subcellular localization, biochemical processing or 

1 5 function of the splice variant. This information can be used to directly or indirectly 

facilitate the generation of additional or novel therapeutics or diagnostics. The nucleotide 
sequence in this novel splice variant can be used as a nucleic acid probe for the diagnosis 
and/or treatment of cancer. 

Specifically, the newly identified sequences may enable the production of new 

20 antibodies or compounds directed against the novel region for use as a therapeutic or 
diagnostic. Alternatively, the newly identified sequences may alter the biochemical or 
biological properties of the encoded protein in such a way as to enable the generation of 
improved or different therapeutics targeting this protein. 

Antibodies 

25 In another aspect, the invention provides antibodies, including fragments and 

derivatives thereof, that bind specifically to polypeptides encoded by the nucleic acid 
molecules of the invention. In a preferred embodiment, the antibodies are specific for a 
polypeptide that is a CSP, or a fragment, mutein, derivative, analog or fusion protein 
thereof. In a more preferred embodiment, the antibodies are specific for a polypeptide that 

30 comprises SEQ ID NO: 1 13-259, or a fragment, mutein, derivative, analog or fusion 
protein thereof. 
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The antibodies of the present invention can be specific for linear epitopes, 
discontinuous epitopes, or conformational epitopes of such proteins or protein fragments, 
either as present on the protein in its native conformation or, in some cases, as present on 
the proteins as denatured, as, e.g. 9 by solubilization in SDS. New epitopes may also be 
5 due to a difference in post translational modifications (PTMs) in disease versus normal 
tissue. For example, a particular site on a CSP may be glycosylated in cancerous cells, but 
not glycosylated in normal cells or vice versa. In addition, alternative splice forms of a 
CSP may be indicative of cancer. Differential degradation of the C or N-terminus of a 
CSP may also be a marker or target for anticancer therapy. For example, a CSP may be 

10 N-terminal degraded in cancer cells exposing new epitopes to antibodies which may 
selectively bind for diagnostic or therapeutic uses. 

As is well known in the art, the degree to which an antibody can discriminate 
among molecular species in a mixture will depend, in part, upon the conformational 
relatedness of the species in the mixture; typically, the antibodies of the present invention 

1 5 will discriminate over adventitious binding to non-CSP polypeptides by at least two-fold, 
more typically by at least 5-fold, typically by more than 10-fold, 25-fold, 50-fold, 75-fold, 
and often by more than 100-fold, and on occasion by more than 500-fold or 1000-fold. 
When used to detect the proteins or protein fragments of the present invention, the 
antibody of the present invention is sufficiently specific when it can be used to determine 

20 the presence of the polypeptide of the present invention in samples derived from human 
colon. 

Typically, the affinity or avidity of an antibody (or antibody multimer, as in the 
case of an IgM pentamer) of the present invention for a protein or protein fragment of the 
present invention will be at least about 1 x 10" 6 molar (M), typically at least about 5 x 10" 7 
25 M, 1 x 10" 7 M, with affinities and avidities of at least 1 x 10" 8 M, 5 x 10" 9 M, 1 x 10" 10 M 
and up to 1 X 10' 13 M proving especially useful. 

The antibodies of the present invention can be naturally occurring forms, such as 
IgG, IgM, IgD, IgE, IgY, and IgA, from any avian, reptilian, or mammalian species. 

Human antibodies can, but will infrequently, be drawn directly from human donors 
30 or human cells. In such case, antibodies to the polypeptides of the present invention will 
typically have resulted from fortuitous immunization, such as autoimmune immunization, 
with the polypeptide of the present invention. Such antibodies will typically, but will not 
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invariably, be polyclonal. In addition, individual polyclonal antibodies may be isolated 
and cloned to generate monoclonals. 

Human antibodies are more frequently obtained using transgenic animals that 
express human immunoglobulin genes, which transgenic animals can be affirmatively 
5 immunized with the protein immunogen of the present invention. Human Ig-transgenic 
mice capable of producing human antibodies and methods of producing human antibodies 
therefrom upon specific immunization are described, inter alia> in U.S. Patent Nos. 
6,162,963; 6,150,584; 6,114,598; 6,075,181; 5,939,598; 5,877,397; 5,874,299; 5,814,318; 
5,789,650; 5,770,429; 5,661,016; 5,633,425; 5,625,126; 5,569,825; 5,545,807; 5,545,806, 
1 0 and 5,591 ,669, the disclosures of which are incorporated herein by reference in their 
entireties. Such antibodies are typically monoclonal, and are typically produced using 
techniques developed for production of murine antibodies. 

Human antibodies are particularly useful, and often preferred, when the antibodies 
of the present invention are to be administered to human beings as in vivo diagnostic or 
15 therapeutic agents, since recipient immune response to the administered antibody will . 
often be substantially less than that occasioned by administration of an antibody derived 
from another species, such as mouse. 

IgG, IgM, IgD, IgE, IgY, and IgA antibodies of the present invention are also 
usefully obtained from other species, including mammals such as rodents (typically 
20 mouse, but also rat, guinea pig, and hamster), lagomorphs (typically rabbits), and also 
larger mammals, such as sheep, goats, cows, and horses; or egg laying birds or reptiles 
such as chickens or alligators. In such cases, as with the transgenic human-antibody- 
producing non-human mammals, fortuitous immunization is not required, and the non- 
human mammal is typically affirmatively immunized, according to standard immunization 
25 protocols, with the polypeptide of the present invention. One form of avian antibodies 
may be generated using techniques described in WO 00/29444, published 25 May 2000, 
which is herein incorporated by reference in its entirety. 

As discussed above, virtually all fragments of 8 or more contiguous amino acids of 
a polypeptide of the present invention can be used effectively as immunogens when 
30 conjugated to a carrier, typically a protein such as bovine thyroglobulin, keyhole limpet 
hemocyanin, or bovine serum albumin, conveniently using a bifunctional linker such as 
those described elsewhere above, which discussion is incorporated by reference here. 
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Immunogenicity can also be conferred by fusion of the polypeptides of the present 
invention to other moieties. For example, polypeptides of the present invention can be 
produced by solid phase synthesis on a branched polylysine core matrix; these multiple 
antigenic peptides (MAPs) provide high purity, increased avidity, accurate chemical 
5 definition and improved safety in vaccine development Tarn et al, Proc. Natl. Acad. Sci. 
USA 85: 5409-5413 (1988); Posnettef al. 9 J. Biol. Chem. 263: 1719-1725 (1988). 

Protocols for immunizing non-human mammals or avian species are well- 
established in the art See Harlow et al. (eds.), Using Antibodies: A Laboratory Manual. 
Cold Spring Harbor Laboratory (1998); Coligan et al (eds.), Current Protocols in 

10 Immunology, John Wiley & Sons, Inc. (2001); Zola, Monoclonal Antibodies: Preparation 
and Use of M onoclonal Antibodies and Engineered Antibody Derivatives fBasics: From 
Background to BenchV Springer Verlag (2000); Gross M, Speck J.Dtsch. Tierarztl 
Wochenschr. 103:417-422(1996). Immunization protocols often include multiple 
immunizations, either with or without adjuvants such as Freund's complete adjuvant and 

15 Freund's incomplete adjuvant, and may include naked DNA immunization. Moss, Semin. 
Immunol. 2: 317-327 (1990). 

Antibodies from non-human mammals and avian species can be polyclonal or 
monoclonal, with polyclonal antibodies having certain advantages in 
immunohistochemical detection of the polypeptides of the present invention and 

20 monoclonal antibodies having advantages in identifying and distinguishing particular 

epitopes of the polypeptides of the present invention. Antibodies from avian species may 
have particular advantage in detection of the polypeptides of the present invention, in 
human serum or tissues. Vikinge et al., Biosens. Bioelectron. 13: 1257-1262 (1998). 
Following immunization, the antibodies of the present invention can be obtained using any 

25 art-accepted technique. Such techniques are well known in the art and are described in 
detail in references such as Coligan, supra; Zola, supra; Howard et al. (eds.), Basic 
Methods in Antibody Production and Characterization. CRC Press (2000); Harlow, supra; 
Davis (ed.), Monoclonal Antibody Protocols- Vol. 45, Humana Press (1995); Delves (ed.), 
Antibody Production: Essential TftchniqiiRg, John Wiley & Son Ltd (1997); and Kenney, 

30 Antibody Solution: An Antibody Methods Manual. Chapman & Hall (1997). 

Briefly, such techniques include, inter alia, production of monoclonal antibodies 
by hybridomas and expression of antibodies or fragments or derivatives thereof from host 
cells engineered to express immunoglobulin genes or fragments thereof. These two 
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methods of production are not mutually exclusive: genes encoding antibodies specific for 
the polypeptides of the present invention can be cloned from hybridomas and thereafter 
expressed in other host cells. Nor need the two necessarily be performed together: e.g. 9 
genes encoding antibodies specific for the polypeptides of the present invention can be 
5 cloned directly from B cells known to be specific for the desired protein, as further 

described in U.S. Patent No. 5,627,052, the disclosure of which is incorporated herein by 
reference in its entirety, or from antibody-displaying phage. 

Recombinant expression in host cells is particularly useful when fragments or 
derivatives of the antibodies of the present invention are desired. 

10 Host cells for recombinant antibody production of whole antibodies, antibody 

fragments, or antibody derivatives can be prokaryotic or eukaryotic. 

Prokaryotic hosts are particularly useful for producing phage displayed antibodies 
of the present invention. 

The technology of phage-displayed antibodies, in which antibody variable region 

15 fragments are fused, for example, to the gene HI protein (pill) or gene Vm protein (pVm) 
for display on the surface of filamentous phage, such as Ml 3, is by now well-established. 
See, e.g., Sidhu, Curr. Opin. Biotechnol 11(6): 610-6 (2000); Griffiths etal 9 Curr. Opin. 
Biotechnol 9(1): 102-8 (1998); Hoogenboom et al. 9 Immunotechnology 9 4(1): 1-20 (1998); 
Rader et al 9 Current Opinion in Biotechnology 8: 503-508 (1997); Aujame et al. 9 Human 

20 Antibodies 8: 155-168 (1997); Hoogenboom, Trends in Biotechnol 15: 62-70 (1997); de 
Kruif et al 9 17: 453-455 (1996); Barbas et al. 9 Trends in Biotechnol. 14: 230-234 (1996); 
Winter et al 9 Ann. Rev. Immunol. 433-455 (1994). Techniques and protocols required to 
generate, propagate, screen (pan), and use the antibody fragments from such libraries have 
recently been compiled. See, e.g. t Barbas (2001), supra; Kay, supra; and Abelson, supra. 

25 Typically, phage-displayed antibody fragments are scFv fragments or Fab 

fragments; when desired, full length antibodies can be produced by cloning the variable 
regions from the displaying phage into a complete antibody and expressing the full length 
antibody in a further prokaryotic or a eukaryotic host cell. Eukaryotic cells are also useful 
for expression of the antibodies, antibody fragments, and antibody derivatives of the 

30 present invention. For example, antibody fragments of the present invention can be 

produced in Pichia pastoris and in Saccharomyces cerevisiae. See, e.g., Takahashi et al 9 
Biosci. Biotechnol. Biochem. 64(10): 2138-44 (2000); Freyre et al. 9 J. Biotechnol. 
76(2-3): 1 57-63 (2000); Fischer et al 9 Biotechnol. Appl. Biochem. 30 (Pt 2): 1 17-20 
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(1999); Pennell et al 9 Res. Immunol 149(6): 599-603 (1998); Eldin et al 9 J. Immunol. 

Methods. 201(1): 67-75 (1997);, Frenken etal 9 Res. Immunol. 149(6): 589-99 (1998); and 

Shusta et al 9 Nature Biotechnol 16(8): 773-7 (1998). 

Antibodies, including antibody fragments and derivatives, of the present invention 
5 can also be produced in insect cells. See, e.g, Li et al 9 Protein Expr. Purif. 21(1): 121-8 

(2001); Ailor et aL, Biotechnol. Bioeng. 58(2-3): 196-203 (1998); Hsu et al. 9 Biotechnol. 

Prog. 13(1): 96-104 (1997); Edelman et al 9 Immunology 91(1): 13-9 (1997); andNesbit et 

al. 9 J. Immunol. Methods 151(1-2): 201-8 (1992). 

Antibodies and fragments and derivatives thereof of the present invention can also 
10 be produced in plant cells, particularly maize or tobacco, Giddings et al 9 Nature 

Biotechnol. 18(11): 1151-5 (2000); Gavilondo et al 9 Biotechniques 29(1): 128-38(2000); 

Fischer et al 9 J. Biol. Regul. Homeost. Agents 14(2): 83-92 (2000); Fischer et al 9 

Biotechnol. Appl. Biochem. 30 (Pt2): 113-6 (1999); Fischer et al 9 Biol. Chem. 380(7-8): 

825-39 (1999); Russell, Curr. Top. Microbiol. Immunol. 240: 119-38 (1999); and Ma et 
15 al 9 Plant Physiol. 109(2): 341-6 (1995). 

Antibodies, including antibody fragments and derivatives, of the present invention 

can also be produced in transgenic, non-human, mammalian milk. See, e.g. Pollock et al., 

J. Immunol Methods. 231: 147-57 (1999); Young et al., Res. Immunol. 149: 609-10 

(1998); andLimontaetal., Immunotechnology 1: 107-13(1995). 
20 Mammalian cells useful for recombinant expression of antibodies, antibody 

fragments, and antibody derivatives of the present invention include CHO cells, COS 

cells, 293 cells, and myeloma cells. Verma et al 9 J. Immunol. Methods 216(1-2): 165-81 

(1998) review and compare bacterial, yeast, insect and mammalian expression systems for 
expression of antibodies. Antibodies of the present invention can also be prepared by cell 

25 free translation, as further described in Merk et al 9 J. Biochem. (Tokyo) 125(2): 328-33 

(1999) and Ryabova et al 9 Nature Biotechnol. 15(1): 79-84 (1997), and in the milk of 
transgenic animals, as further described in Pollock et al. 9 J. Immunol Methods 231(1-2): 
147-57 (1999). 

The invention further provides antibody fragments that bind specifically to one or 
30 more of the polypeptides of the present invention or to one or more of the polypeptides 
encoded by the isolated nucleic acid molecules of the present invention, or the binding of 
which can be competitively inhibited by one or more of the polypeptides of the present 
invention or one or more of the polypeptides encoded by the isolated nucleic acid 



WO 2004/050900 



PCT/US2003/040131 



95 

molecules of the present invention. Among such useful fragments are Fab, Fab', Fv, 
F(ab)'2, and single chain Fv (scFv) fragments. Other useful fragments are described in 
Hudson, Curr. Opin. Biotechnol 9(4): 395-402 (1998). 

The present invention also relates to antibody derivatives that bind specifically to 
5 one or more of the polypeptides of the present invention, to one or more of the 

polypeptides encoded by the isolated nucleic acid molecules of the present invention, or 
the binding of which can be competitively inhibited by one or more of the polypeptides of 
the present invention or one or more of the polypeptides encoded by the isolated nucleic 
acid molecules of the present invention. 
10 Among such useful derivatives are chimeric, primatized, and humanized 

antibodies; such derivatives are less immunogenic in human beings, and thus are more 
suitable for in vivo administration, than are unmodified antibodies from non-human 
mammalian species. Another useful method is PEGylation to increase the serum half life 
of the antibodies. 

1 5 Chimeric antibodies typically include heavy and/or light chain variable regions 

(including both CDR and framework residues) of immunoglobulins of one species, 
typically mouse, fused to constant regions of another species, typically human. See, e.g., 
Morrison et al, Proc. Natl Acad. Sci USA.S1(21): 6851-5 (1984); Sharon et al y Nature 
309(5966): 364-7 (1984); Takeda et aL, Nature 314(6010): 452-4 (1985); and U.S. Patent 

20 No. 5,807,715 the disclosure of which is incorporated herein by reference in its entirety. 
Primatized and humanized antibodies typically include heavy and/or light chain CDRs 
from a murine antibody grafted into a non-human primate or human antibody V region 
framework, usually further comprising a human constant region, Riechmann et al 9 Nature 
332(6162): 323-7 (1988); Co et al 9 Nature 351(6326): 501-2 (1991); and U.S. Patent Nos. 

25 6,054,297; 5,821,337; 5,770,196; 5,766,886; 5,821,123; 5,869,619; 6,180,377; 6,013,256; 
5,693,761; and 6,180,370, the disclosures of which are incorporated herein by reference in 
their entireties. Other useful antibody derivatives of the invention include heteromeric 
antibody complexes and antibody fusions, such as diabodies (bispecific antibodies), 
single-chain diabodies, and interbodies. 

30 It is contemplated that the nucleic acids encoding the antibodies of the present 

invention can be operably joined to other nucleic acids forming a recombinant vector for 
cloning or for expression of the antibodies of the invention. Accordingly, the present 
invention includes any recombinant vector containing the coding sequences, or part 
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thereof, whether for eukaryotic transduction, transfection or gene therapy. Such vectors 
may be prepared using conventional molecular biology techniques, known to those with 
skill in the art, and would comprise DNA encoding sequences for the immunoglobulin V- 
regions including framework and CDRs or parts thereof, and a suitable promoter either 
5 with or without a signal sequence for intracellular transport Such vectors may be 

transduced or transfected into eukaryotic cells or used for gene therapy (Marasco et al., 
Proc. Natl. Acad. ScL (USA) 90: 7889-7893 (1993); Duan et al, Proc. Natl Acad. Sci. 
(USA) 91 : 5075-5079 (1994), by conventional techniques, known to those with skill in 
the art 

10 The antibodies of the present invention, including fragments and derivatives 

thereof, can usefully be labeled. It is, therefore, another aspect of the present invention to 
provide labeled antibodies that bind specifically to one or more of the polypeptides of the 
present invention, to one or more of the polypeptides encoded by the isolated nucleic acid 
molecules of the present invention, or the binding of which can be competitively inhibited 

15 by one or more of the polypeptides of the present invention or one or more of the 

polypeptides encoded by the isolated nucleic acid molecules of the present invention. The 
choice of label depends, in part, upon the desired use. 

For example, when the antibodies of the present invention are used for 
immunohistochemical staining of tissue samples, the label can usefully be an enzyme that 

20 catalyzes production and local deposition of a detectable product. Enzymes typically 
conjugated to antibodies to permit their immunohistochemical visualization are well 
known, and include alkaline phosphatase* p-galactosidase, glucose oxidase, horseradish 
peroxidase (HRP), and urease. Typical substrates for production and deposition of 
visually detectable products include o-nitrophenyl-beta-D-galactopyranoside (ONPG); 

25 o-phenylenediamine dihydrochloride (OPD); p-nitrophenyl phosphate (PNPP); p- 

nitrophenyl-beta-D-galactopryanoside (PNPG); 3',3'-dianMnobenzidine (DAB); 3-amino- 
9-ethylcarbazole (AEC); 4-chloro-l-naphthol (CN); 

5-bromo-4-chloro-3-indolyl-phosphate (BCIP); ABTS®; BluoGal; iodonitrotetrazolium 
(TNT); nitroblue tetrazolium chloride (NBT); phenazine methosulfate (PMS); 
30 phenolphthalein monophosphate (PMP); tetramethyl benzidine (1MB); tetranitroblue 
tetrazolium (TNBT); X-Gal; X-Gluc; and X-Glucoside. 

Other substrates can be used to produce products for local deposition that are 
luminescent For example, in the presence of hydrogen peroxide (H2O2), horseradish 
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peroxidase (HRP) can catalyze the oxidation of cyclic diacylhydrazides, such as luminol. 
Immediately following the oxidation, the luminol is in an excited state (intermediate 
reaction product), which decays to the ground state by emitting light Strong enhancement 
of the light emission is produced by enhancers, such as phenolic compounds. Advantages 
5 include high sensitivity, high resolution, and rapid detection without radioactivity and 

requiring only small amounts of antibody. See, e.g., Thorpe et al, Methods Enzymol. 133: 
331-53 (1986); Kricka et al, J. Immunoassay 17(1): 67-83 (1996); and Lundqvist et al., J. 
Biolumin. Chemilumin. 10(6): 353-9 (1995). Kits for such enhanced chemiluminescent 
detection (ECL) are available commercially. The antibodies can also be labeled using 
10 colloidal gold. 

As another example, when the antibodies of the present invention are used, e.g. 9 for 
flow cytometric detection, for scanning laser cytometric detection, or for fluorescent 
immunoassay, they can usefully be labeled with fluorophores. There are a wide variety of 
fluorophore labels that can usefully be attached to the antibodies of the present invention. 

15 For flow cytometric applications, both for extracellular detection and for intracellular 
detection, common useful fluorophores can be fluorescein isothiocyanate (FTTC), 
allophycocyanin (APC), R-phycoerythrin (PE), peridinin chlorophyll protein (PerCP), 
Texas Red, Cy3, Cy5, fluorescence resonance energy tandem fluorophores such as PerCP- 
Cy5.5, PE-Cy5, PE-Cy5.5, PE-Cy7, PE-Texas Red, and APC-Cy7. 

20 Other fluorophores include, inter alia, Alexa Fluor® 350, Alexa Fluor® 488, 

Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 568, Alexa Fluor® 594, Alexa 
Fluor® 647 (monoclonal antibody labeling kits available from Molecular Probes, Inc., 
Eugene, OR, USA), BODIPY dyes, such as BODIPY 493/503, BODIPY FL, BODIPY 
R6G, BODIPY 530/550, BODIPY TMR, BODIPY 558/568, BODIPY 558/568, BODIPY 

25 564/570, BODIPY 576/589, BODIPY 581/591, BODIPY TR, BODIPY 630/650, 
BODIPY 650/665, Cascade Blue, Cascade Yellow, Dansyl, lissamine rhodamine B, 
Marina Blue, Oregon Green 488, Oregon Green 514, Pacific Blue, rhodamine 6G, 
rhodamine green, rhodamine red, tetramethylrhodamine, Texas Red (available from 
Molecular Probes, Inc., Eugene, OR, USA), and Cy2, Cy3, Cy3.5, CyS, Cy5.5, Cy7, all of 

30 which are also useful for fluorescently labeling the antibodies of the present invention. 
For secondary detection using labeled avidin, streptavidin, captavidin or neutravidin, the 
antibodies of the present invention can usefully be labeled with biotin. 
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When the antibodies of the present invention are used, e.g., for western blotting 
applications, they can usefully be labeled with radioisotopes, such as 33 P, 32 P, 35 S, 3 H, and 
125 I. As another example, when the antibodies of the present invention are used for 
radioimmunotherapy, the label can usefully be 228 Th, 227 Ac, 225 Ac, ^Ra, 213 Bi, 2l2 Pb, 
5 212 Bi, 211 At, 203 Pb, 194 Os, 188 Re, l86 Re, 153 Sm, 149 Tb, 131 1, 125 I, lll In, ,05 Rh, 99m Tc, 97 Ru, 90 Y, 
90 &, 8, Y B ia S^ w CD f or 47 Sa 

As another example, when the antibodies of the present invention are to be used 
for in vivo diagnostic use, they can be rendered detectable by conjugation to MRI contrast 
agents, such as gadolinium diethylenetriaminepentaacetic acid (DTP A), Lauffer et al 9 

10 Radiology 207(2): 529-38 (1998), or by radioisotopic labeling. 

As would be understood, use of the labels described above is not restricted to the 
application as for which they were mentioned. 

The antibodies of the present invention, including fragments and derivatives 
thereof, can also be conjugated to toxins, in order to target the toxin's ablative action to 

1 5 cells that display and/or express the polypeptides of the present invention. Commonly, the 
antibody in such immunotoxins is conjugated to Pseudomonas exotoxin A, diphtheria 
toxin, shiga toxin A, anthrax toxin lethal factor, or ricin. See Hall (ed.), Immunotoxin 
Methods and Protocols (Methods in Molecular Biology, vol. 166), Humana Press (2000); 
and Frankel et al (eds.), Clinical Applicatio ns nf Tmmunotoxins. Springer- Verlag (1998). 

20 The antibodies of the present invention can usefully be attached to a substrate, and 

it is, therefore, another aspect of the invention to provide antibodies that bind specifically 
to one or more of the polypeptides of the present invention, to one or more of the 
polypeptides encoded by the isolated nucleic acid molecules of the present invention, or 
the binding of which can be competitively inhibited by one or more of the polypeptides of 

25 the present invention or one or more of the polypeptides encoded by the isolated nucleic 
acid molecules of the present invention, attached to a substrate. Substrates can be porous 
or nonporous, planar or nonplanar. For example, the antibodies of the present invention 
can usefully be conjugated to filtration media, such as NHS-activated Sepharose or CNBr- 
activated Sepharose for purposes of immunoaffinity chromatography. For example, the 

30 antibodies of the present invention can usefully be attached to paramagnetic microspheres, 
typically by biotin-streptavidin interaction, which microsphere can then be used for 
isolation of cells that express or display the polypeptides of the present invention. As 
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another example, the antibodies of the present invention can usefully be attached to the 
surface of a microtiter plate for ELISA. 

As noted above, the antibodies of the present invention can be produced in 
prokaryotic and eukaryotic cells. It is, therefore, another aspect of the present invention to 
5 provide cells that express the antibodies of the present invention, including hybridoma 
cells, B cells, plasma cells, and host cells recombinantly modified to express the 
antibodies of the present invention. 

In yet a further aspect, the present invention provides aptamers evolved to bind 
specifically to one or more of the CSPs of the present invention or to polypeptides 
10 encoded by the CSNAs of the invention. 

In sum, one of skill in the art, provided with the teachings of this invention, has 
available a variety of methods which may be used to alter the biological properties of the 
antibodies of this invention including methods which would increase or decrease the 
stability or half-life, immunogenicity, toxicity, affinity or yield of a given antibody 
1 5 molecule, or to alter it in any other way that may render it more suitable for a particular 
application. 

Transgenic Ani mals and Cells 

In another aspect, the invention provides transgenic cells and non-human 
organisms comprising nucleic acid molecules of the invention. In a preferred 

20 embodiment, the transgenic cells and non-human organisms comprise a nucleic acid 

molecule encoding a CSP. In a preferred embodiment, the CSP comprises an amino acid 
sequence selected from SEQ ID NO: 1 13-259, or a fragment, mutein, homologous protein 
or allelic variant thereof. In another preferred embodiment, the transgenic cells and non- 
human organism comprise a CSNA of the invention, preferably a CSNA comprising a 

25 nucleotide sequence selected from the group consisting of SEQ ID NO: 1-1 12, or a part, 
substantially similar nucleic acid molecule, allelic variant or hybridizing nucleic acid 
molecule thereof. 

In another embodiment, the transgenic cells and non-human organisms have a 
targeted disruption or replacement of the endogenous orthologue of the human CSG. The 
30 transgenic cells can be embryonic stem cells or somatic cells. The transgenic non-human 
organisms can be chimeric, nonchimeric heterozygotes, and nonchimeric homozygotes. 
Methods of producing transgenic animals are well known in the art. See, e.g. 9 Hogan et 
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al 9 Manipulating the Mouse Embryo: A Laboratory Manual, 2d ed., Cold Spring Harbor 
Press (1999); Jackson et al 9 Mouse Genetics and Transgenics: A Practical Approach. 
Oxford University Press (2000); and Pinkert, Transgenic Animal Technology: A 
Laboratory Handbook- Academic Press (1999). 
5 Any technique known in the art may be used to introduce a nucleic acid molecule 

of the invention into an animal to produce the founder lines of transgenic animals. Such 
techniques include, but are not limited to, pronuclear microinjection, (see, e.g. 9 Paterson 
et al 9 Appl Microbiol Biotechnol 40: 691-698 (1994); Carver et al 9 Biotechnology 11: 
1263-1270 (1993); Wright et al 9 Biotechnology 9: 830-834 (1991); and U.S. Patent No. 

10 4,873,191, herein incorporated by reference in its entirety); retrovirus-mediated gene 
transfer into germ lines, blastocysts or embryos {see, e.g., Van der Putten et al 9 Proc. 
Natl Acad. Set, USA 82: 6148-6152 (1985)); gene targeting in embryonic stem cells (see, 
e.g, Thompson et al, Cell 56: 313-321 (1989)); electroporation of cells or embryos (see, 
e.g., Lo, 1983, Mol. Cell Biol. 3: 1803-1814 (1983)); introduction using a gene gun (see, 

15 e.g. 9 Ulmer et al, Science 259: 1745-49 (1993); introducing nucleic acid constructs into 
embryonic pleuripotent stem cells and transferring the stem cells back into the blastocyst; 
and sperm-mediated gene transfer (see, e.g, Lavitrano et al 9 Cell 57: 717-723 (1989)). 

Other techniques include, for example, nuclear transfer into enucleated oocytes of 
nuclei from cultured embryonic, fetal, or adult cells induced to quiescence (see, e.g, 

20 Campell et al 9 Nature 380: 64-66 (1996); Wilmut et al 9 Nature 385: 810-813 (1997)). 
The present invention provides for transgenic animals that carry the transgene (te. 9 a 
nucleic acid molecule of the invention) in all their cells, as well as animals which carry the 
transgene in some, but not all their cells, i.e. e., mosaic animals or chimeric animals. 

The transgene may be integrated as a single transgene or as multiple copies, such 

25 as in concatamers, e. g., head-to-head tandems or head-to-tail tandems. The transgene 

may also be selectively introduced into and activated in a particular cell type by following, 
e.g. 9 the teaching of Lasko et al et al 9 Proc. Natl Acad. Sci. USA 89: 6232- 6236 (1992). 
The regulatory sequences required for such a cell-type specific activation will depend 
upon the particular cell type of interest, and will be apparent to those of skill in the art 

30 Once transgenic animals have been generated, the expression of the recombinant 

gene may be assayed utilizing standard techniques, initial screening may be accomplished 
by Southern blot analysis or PGR techniques to analyze animal tissues to verify that 
integration of the transgene has taken place. The level of mRNA expression of the 
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transgene in the tissues of the transgenic animals may also be assessed using techniques 
which include, but are not limited to, Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and reverse transcriptase-PCR (RT-PCR). 
Samples of transgenic gene-expressing tissue may also be evaluated 
5 immunocytochemically or immunohistochemically using antibodies specific for the 
transgene product 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal. Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more than 

10 one integration site in order to establish separate lines; inbreeding of separate lines in 

order to produce compound transgenics that express the transgene at higher levels because 
of the effects of additive expression of each transgene; crossing of heterozygous 
transgenic animals to produce animals homozygous for a given integration site in order to 
both augment expression and eliminate the need for screening of animals by DNA 

1 5 analysis; crossing of separate homozygous lines to produce compound heterozygous or 
homozygous lines; and breeding to place the transgene on a distinct background that is 
appropriate for an experimental model of interest 

Transgenic animals of the invention have uses which include, but are not limited 
to, animal model systems useful in elaborating the biological function of polypeptides of 

20 the present invention, studying conditions and/or disorders associated with aberrant 
expression, and in screening for compounds effective in ameliorating such conditions 
and/or disorders. 

Methods for creating a transgenic animal with a disruption of a targeted gene are 
also well known in the art. In general, a vector is designed to comprise some nucleotide 

25 sequences homologous to the endogenous targeted gene. The vector is introduced into a 
cell so that it may integrate, via homologous recombination with chromosomal sequences, 
into the endogenous gene, thereby disrupting the function of the endogenous gene. The 
transgene may also be selectively introduced into a particular cell type, thus inactivating 
the endogenous gene in only that cell type. See, e.g., Gu et al., Science 265: 103-106 

30 (1994). The regulatory sequences required for such a cell-type specific inactivation will 
depend upon the particular cell type of interest, and will be apparent to those of skill in the 
art See, e.g., Smithies et al 9 Nature 317: 230-234 (1985); Thomas et al., Cell 51: 503- 
512 (1987); Thompson etal., Cell 5: 313-321 (1989). 
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In one embodiment, a mutant, non-functional nucleic acid molecule of the 
invention (or a completely unrelated DNA sequence) flanked by DNA homologous to the 
endogenous nucleic acid sequence (either the coding regions or regulatory regions of the 
gene) can be used, with or without a selectable marker and/or a negative selectable 
5 marker, to transfect cells that express polypeptides of the invention in vivo. In another 
embodiment, techniques known in the art are used to generate knockouts in cells that 
contain, but do not express the gene of interest Insertion of the DNA construct, via 
targeted homologous recombination, results in inactivation of the targeted gene. Such 
approaches are particularly suited in research and agricultural fields where modifications 
10 to embryonic stem cells can be used to generate animal offspring with an inactive targeted 
gene. See, e.g., Thomas, supra and Thompson, supra. However this approach can be 
routinely adapted for use in humans provided the recombinant DNA constructs are directly 
administered or targeted to the required site in vivo using appropriate viral vectors that 
will be apparent to those of skill in the art. 
15 In further embodiments of the invention, cells that are genetically engineered to 

express the polypeptides of the invention, or alternatively, that are genetically engineered 
not to express the polypeptides of the invention (e.g, knockouts) are administered to a 
patient in vivo. Such cells may be obtained from an animal or patient or an MHC 
compatible donor and can include, but are not limited to fibroblasts, bone marrow cells, 
20 blood cells (e.g 9 lymphocytes), adipocytes, muscle cells, endothelial cells etc. The cells 
are genetically engineered in vitro using recombinant DNA techniques to introduce the 
coding sequence of polypeptides of the invention into the cells, or alternatively, to disrupt 
the coding sequence and/or endogenous regulatory sequence associated with the 
polypeptides of the invention, e.g., by transduction (using viral vectors, and preferably 
25 vectors that integrate the transgene into the cell genome) or transfection procedures, 
including, but not limited to, the use of plasmids, cosmids, YACs, naked DNA, 
electroporation, liposomes, etc. 

The coding sequence of the polypeptides of the invention can be placed under the 
control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 
30 expression, and preferably secretion, of the polypeptides of the invention. The engineered 
cells which express and preferably secrete the polypeptides of the invention can be 
introduced into the patient systemically, e.g., in the circulation, or intraperitoneally. 
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Alternatively, the cells can be incorporated into a matrix and implanted in the 
body, e.g., genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. See, e.g., U.S. Patent Nos. 5,399,349 and 5,460,959, each of which is 
5 incorporated by reference herein in its entirety. 

When the cells to be administered are non-autologous or non-MHC compatible 
cells, they can be administered using well known techniques which prevent the 
development of a host immune response against the introduced cells. For example, the 
cells may be introduced in an encapsulated form which, while allowing for an exchange of 
10 components with the immediate extracellular environment, does not allow the introduced 
cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, but 
are not limited to, animal model systems useful in elaborating the biological function of 
polypeptides of the present invention, studying conditions and/or disorders associated with 
1 5 aberrant expression, and in screening for compounds effective in ameliorating such 
conditions and/or disorders. 

Computer Readable Means 

A further aspect of the invention is a computer readable means for storing the 
nucleic acid and amino acid sequences of the instant invention. In a preferred 

20 embodiment, the invention provides a computer readable means for storing SEQ ID NO: 
1 13-259 and SEQ ID NO: 1-1 12 as described herein, as the complete set of sequences or 
in any combination. The records of the computer readable means can be accessed for 
reading and display and for interface with a computer system for the application of 
programs allowing for the location of data upon a query for data meeting certain criteria, 

25 the comparison of sequences, the alignment or ordering of sequences meeting a set of 
criteria, and the like. 

The nucleic acid and amino acid sequences of the invention are particularly useful 
as components in databases useful for search analyses as well as in sequence analysis 
algorithms. As used herein, the terms "nucleic acid sequences of the invention" and 

30 "amino acid sequences of the invention" mean any detectable chemical or physical 

characteristic of a polynucleotide or polypeptide of the invention that is or may be reduced 
to or stored in a computer readable form. These include, without limitation, 



WO 2004/050900 



PCT/US2003/040131 



104 

chromatographic scan data or peak data, photographic data or scan data therefrom, and 
mass spectro graphic data. 

This invention provides computer readable media having stored thereon sequences 
of the invention. A computer readable medium may comprise one or more of the 

5 following: a nucleic acid sequence comprising a sequence of a nucleic acid sequence of 
the invention; an amino acid sequence comprising an amino acid sequence of the 
invention; a set of nucleic acid sequences wherein at least one of said sequences comprises 
the sequence of a nucleic acid sequence of the invention; a set of amino acid sequences 
wherein at least one of said sequences comprises the sequence of an amino acid sequence 

10 of the invention; a data set representing a nucleic acid sequence comprising the sequence 
of one or more nucleic acid sequences of the invention; a data set representing a nucleic 
acid sequence encoding an amino acid sequence comprising the sequence of an amino acid 
sequence of the invention; a set of nucleic acid sequences wherein at least one of said 
sequences comprises the sequence of a nucleic acid sequence of the invention; a set of 

15 amino acid sequences wherein at least one of said sequences comprises the sequence of an 
amino acid sequence of the invention; a data set representing a nucleic acid sequence 
comprising the sequence of a nucleic acid sequence of the invention; a data set 
representing a nucleic acid sequence encoding an amino acid sequence comprising the 
sequence of an amino acid sequence of the invention. The computer readable medium can 

20 be any composition of matter used to store information or data, including, for example, 
commercially available floppy disks, tapes, hard drives, compact disks, and video disks. 

Also provided by the invention are methods for the analysis of character 
sequences, particularly genetic sequences. Preferred methods of sequence analysis 
include, for example, methods of sequence homology analysis, such as identity and 

25 similarity analysis, RNA structure analysis, sequence assembly, cladistic analysis, 

sequence motif analysis, open reading frame determination, nucleic acid base calling, and 
sequencing chromatogram peak analysis. 

A computer-based method is provided for performing nucleic acid sequence 
identity or similarity identification. This method comprises the steps of providing a 

30 nucleic acid sequence comprising the sequence of a nucleic acid of the invention in a 
computer readable medium; and comparing said nucleic acid sequence to at least one 
nucleic acid or amino acid sequence to identify sequence identity or similarity. 



WO 2004/050900 



PCT/US2003/040131 



105 

A computer-based method is also provided for performing amino acid homology 
identification, said method comprising the steps of: providing an amino acid sequence 
comprising the sequence of an amino acid of the invention in a computer readable 
medium; and comparing said amino acid sequence to at least one nucleic acid or an amino 

5 acid sequence to identify homology. 

A computer-based method is still further provided for assembly of overlapping 
nucleic acid sequences into a single nucleic acid sequence, said method comprising the 
steps of: providing a first nucleic acid sequence comprising the sequence of a nucleic acid 
of the invention in a computer readable medium; and screening for at least one 

10 overlapping region between said first nucleic acid sequence and a second nucleic acid 
sequence. In addition, the invention includes a method of using patterns of expression 
associated with either the nucleic acids or proteins in a computer-based method to 
diagnose disease. 

Diagnostic Methods for Colon Cancer 

15 The present invention also relates to quantitative and qualitative diagnostic assays 

and methods for detecting, diagnosing, monitoring, staging and predicting cancers by 
comparing expression of a CSNA or a CSP in a human patient that has or may have colon 
cancer, or who is at risk of developing colon cancer, with the expression of a CSNA or a 
CSP in a normal human control. For purposes of the present invention, "expression of a 

20 CSNA" or "CSNA expression" means the quantity of CSNA mRNA that can be measured 
• by any method known in the art or the level of transcription that can be measured by any 
method known in the art in a cell, tissue, organ or whole patient Similarly, the term 
"expression of a CSP" or "CSP expression" means the amount of CSP that can be 
measured by any method known in the art or the level of translation of a CSNA that can be 

25 measured by any method known in the art 

The present invention provides methods for diagnosing colon cancer in a patient, 
in particular adenocarcinoma, by analyzing for changes in levels of CSNA or CSP in cells, 
tissues, organs or bodily fluids compared with levels of CSNA or CSP in cells, tissues, 
organs or bodily fluids of preferably the same type from a normal human control, wherein 

30 an increase, or decrease in certain cases, in levels of a CSNA or CSP in the patient versus 
the normal human control is associated with the presence of colon cancer or with a 
predilection to the disease. In another preferred embodiment, the present invention 
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provides methods for diagnosing colon cancer in a patient by analyzing changes in the 
structure of the mRNA of a CSG compared to the mRNA from a normal control. These 
changes include, without limitation, aberrant splicing, alterations in polyadenylation 
and/or alterations in 5* nucleotide capping. In yet another preferred embodiment, the 

5 present invention provides methods for diagnosing colon cancer in a patient by analyzing 
changes in a CSP compared to a CSP from a normal patient. These changes include, e.g., 
alterations, including post translational modifications such as glycosylation and/or 
phosphorylation of the CSP or changes in the subcellular CSP localization. 

For purposes of the present invention, diagnosing means that CSNA or CSP levels 

10 are used to determine the presence or absence of disease in a patient. As will be 

understood by those of skill in the art, measurement of other diagnostic parameters may be 
required for definitive diagnosis or determination of the appropriate treatment for the 
disease. The determination may be made by a clinician, a doctor, a testing laboratory, or a 
patient using an over the counter test The patient may have symptoms of disease or may 

15 be asymptomatic. In addition, the CSNA or CSP levels of the present invention may be 
used as screening marker to determine whether further tests or biopsies are warranted. In 
addition, the CSNA or CSP levels may be used to determine the vulnerability or 
susceptibility to disease. 

In a preferred embodiment, the expression of a CSNA is measured by determining 

20 the amount of a mRNA that encodes an amino acid sequence selected from SEQ ID NO: 
1 13-259, a homolog, an allelic variant, or a fragment thereof. In a more preferred 
embodiment, the CSNA expression that is measured is the level of expression of a CSNA 
mRNA selected from SEQ ID NO: 1-1 12, or a hybridizing nucleic acid, homologous 
nucleic acid or allelic variant thereof, or a part of any of these nucleic acid molecules. 

25 CSNA expression may be measured by any method known in the art, such as those 

described supra, including measuring mRNA expression by Northern blot, quantitative or 
qualitative reverse transcriptase PCR (RT-PCR), microarray, dot or slot blots or in situ 
hybridization. See, e.g. y Ausubel (1992), supra; Ausubel (1999), supra; Sambrook 
(1989), supra; and Sambrook (2001), supra. CSNA transcription may be measured by any 

30 method known in the art including using a reporter gene hooked up to the promoter of a 
CSG of interest or doing nuclear run-off assays. Alterations in mRNA structure, e.g. 9 
aberrant splicing variants, may be determined by any method known in the art, including, 
RT-PCR followed by sequencing or restriction analysis. As necessary, CSNA expression 
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may be compared to a known control, such as normal colon nucleic acid, to detect a 
change in expression. 

In another preferred embodiment, the expression of a CSP is measured by 
determining the level of a CSP having an amino acid sequence selected from the group 
5 consisting of SEQ ID NO: 1 13-259, a homolog, an allelic variant, or a fragment thereof. 
Such levels are preferably determined in at least one of cells, tissues, organs and/or bodily 
fluids, including determination of normal and abnormal levels. Thus, for instance, a 
diagnostic assay in accordance with the invention for diagnosing over- or underexpression 
of a CSNA or CSP compared to normal control bodily fluids, cells, or tissue samples may 

10 be used to diagnose the presence of colon cancer. The expression level of a CSP may be 
determined by any method known in the art, such as those described supra. In a preferred 
embodiment, the CSP expression level may be determined by radioimmunoassays, 
competitive-binding assays, ELISA, Western blot, FACS, immunohistochemistry, 
immunoprecipitation, proteomic approaches: two-dimensional gel electrophoresis (2D 

15 electrophoresis) and non-gel-based approaches such as mass spectrometry or protein 

interaction profiling. See, e.g, Harlow (1999), supra; Ausubel (1992), supra; and Ausubel 
(1999), supra. Alterations in the CSP structure may be determined by any method known 
in the art, including, e.g., using antibodies that specifically recognize phosphoserine, 
phosphothreonine or phosphotyrosine residues, two-dimensional polyacrylamide gel 

20 electrophoresis (2D PAGE) and/or chemical analysis of amino acid residues of the protein. 
Id. 

In a preferred embodiment, a radioimmunoassay (RIA) or an ELISA is used. An 
antibody specific to a CSP is prepared if one is not already available. In a preferred 
embodiment, the antibody is a monoclonal antibody. The anti-CSP antibody is bound to a 

25 solid support and any free protein binding sites on the solid support are blocked with a 

protein such as bovine serum albumin. A sample of interest is incubated with the antibody 
on the solid support under conditions in which the CSP will bind to the anti-CSP antibody. 
The sample is removed, the solid support is washed to remove unbound material, and an 
anti-CSP antibody that is linked to a detectable reagent (a radioactive substance for RIA 

30 and an enzyme for ELISA) is added to the solid support and incubated under conditions in 
which binding of the CSP to the labeled antibody will occur. After binding, the unbound 
labeled antibody is removed by washing. For an ELISA, one or more substrates are added 
to produce a colored reaction product that is based upon the amount of a CSP in the 
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sample. For an RIA, the solid support is counted for radioactive decay signals by any 
method known in the art. Quantitative results for both RIA and ELISA typically are 
obtained by reference to a standard curve. 

Other methods to measure CSP levels are known in the art For instance, a 

5 competition assay may be employed wherein an anti-CSP antibody is attached to a solid 
support and an allocated amount of a labeled CSP and a sample of interest are incubated 
with the solid support. The amount of labeled CSP attached to the solid support can be 
correlated to the quantity of a CSP in the sample. 

Of the proteomic approaches, 2D PAGE is a well known technique. Isolation of 

10 individual proteins from a sample such as serum is accomplished using sequential 

separation of proteins by isoelectric point and molecular weight Typically, polypeptides 
are first separated by isoelectric point (the first dimension) and then separated by size 
using an electric current (the second dimension). In general, the second dimension is 
perpendicular to the first dimension. Because no two proteins with different sequences are 

1 5 identical on the basis of both size and charge, the result of 2D PAGE is a roughly square 
gel in which each protein occupies a unique spot. Analysis of the spots with chemical or 
antibody probes, or subsequent protein microsequencing can reveal the relative abundance 
of a given protein and the identity of the proteins in the sample. 

Expression levels of a CSNA can be determined by any method known in the art, 

20 including PCR and other nucleic acid methods, such as ligase chain reaction (LCR) and 
nucleic acid sequence based amplification (NASB A), can be used to detect malignant cells 
for diagnosis and monitoring of various malignancies. For example, reverse-transcriptase 
PCR (RT-PCR) is a powerful technique which can be used to detect the presence of a 
specific mRNA population in a complex mixture of thousands of other mRNA species. In 

25 RT-PCR, an mRNA species is first reverse transcribed to complementary DNA (cDNA) 
with use of the enzyme reverse transcriptase; the cDNA is then amplified as in a standard 
PCR reaction. 

Hybridization to specific DNA molecules (e.g., oligonucleotides) arrayed on a 
solid support can be used to both detect the expression of and quantitate the level of 
30 expression of one or more CSNAs of interest In this approach, all or a portion of one or 
more CSNAs is fixed to a substrate. A sample of interest, which may comprise RNA, e.g., 
total RNA or polyA-selected mRNA, or a complementary DNA (cDNA) copy of the RNA 
is incubated with the solid support under conditions in which hybridization will occur 
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between the DNA on the solid support and the nucleic acid molecules in the sample of 
interest. Hybridization between the substrate-bound DNA and the nucleic acid molecules 
in the sample can be detected and quantitated by several means, including, without 
limitation, radioactive labeling or fluorescent labeling of the nucleic acid molecule or a 
5 secondary molecule designed to detect the hybrid. 

The above tests can be carried out on samples derived from a variety of cells, 
bodily fluids and/or tissue extracts such as homogenates or solubilized tissue obtained 
from a patient Tissue extracts are obtained routinely from tissue biopsy and autopsy 
material. Bodily fluids useful in the present invention include blood, urine, saliva or any 
10 other bodily secretion or derivative thereof As used herein "blood" includes whole blood, 
plasma, serum, circulating epithelial cells, constituents, or any derivative of blood. 

In addition to detection in bodily fluids, the proteins and nucleic acids of the 
invention are suitable to detection by cell capture technology. Whole cells may be 
captured by a variety methods for example magnetic separation, such as described in U.S. 
15 Patent Nos. 5,200,084; 5,186,827; 5,108,933; and 4,925,788, the disclosures of which are 
incorporated herein by reference in their entireties. Epithelial cells may be captured using 
such products as Dynabeads® or CELLection™ (Dynal Biotech, Oslo, Norway). 
Alternatively, fractions of blood may be captured, e.g., the buffy coat fraction (50mm cells 
isolated from 5ml of blood) containing epithelial cells. In addition, cancer cells may be 
20 captured using the techniques described in WO 00/47998, the disclosure of which is 
incorporated herein by reference in its entirety. Once the cells are captured or 
concentrated, the proteins or nucleic acids are detected by the means described in the 
subject application. Alternatively, nucleic acids may be captured directly from blood 
samples, see U.S. Patent Nos. 6,156,504, 5,501,963; or WO 01/42504 , the disclosures of 
25 which are incorporated herein by reference in their entireties. 

In a preferred embodiment, the specimen tested for expression of CSNA or CSP 
includes without limitation colon tissue, fecal samples, colonocytes, colon cells grown in 
cell culture, blood, serum, lymph node tissue, and lymphatic fluid. In another preferred 
embodiment, especially when metastasis of a primary colon cancer is known or suspected, 
30 specimens include, without limitation, tissues from brain, bone, bone marrow, liver, lungs, 
and adrenal glands. In general, the tissues may be sampled by biopsy, including, without 
limitation, needle biopsy, e.g. 9 transthoracic needle aspiration, cervical mediatinoscopy, 
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endoscopic lymph node biopsy, video-assisted thoracoscopy, exploratory thoracotomy, 
bone marrow biopsy and bone marrow aspiration. 

Colonocytes represent an important source of the CSP or CSNA because they 
provide a picture of the immediate past metabolic history of the GI tract of a subject. In 
5 addition, such cells are representative of the cell population from a statistically large 
sampling frame reflecting the state of the colonic mucosa along the entire length of the 
colon in a non-invasive manner, in contrast to a limited sampling by colonic biopsy using 
an invasive procedure involving endoscopy. Specific examples of patents describing the 
isolation of colonocytes include U.S. Patent Nos. 6,335,193; 6,020,137 5,741,650; 

10 6,258,541; US 2001 0026925 Al; WO 00/63358 Al, the disclosures of which are 
incorporated herein by reference in their entireties. 

All the methods of the present invention may optionally include determining the 
expression levels of one or more other cancer markers in addition to determining the 
expression level of a CSNAs or CSP. In many cases, the use of another cancer marker 

15 will decrease the likelihood of false positives or false negatives. In one embodiment, the 
one or more other cancer markers include other CSNA or CSPs as disclosed herein. Other 
cancer markers useful in the present invention will depend on the cancer being tested and 
are known to those of skill in the art. In a preferred embodiment, at least one other cancer 
marker in addition to a particular CSNA or CSP is measured. In a more preferred 

20 embodiment, at least two other additional cancer markers are used. In an even more 

preferred embodiment, at least three, more preferably at least five, even more preferably at 
least ten additional cancer markers are used. 

Diagnosing 

In one aspect, the invention provides a method for determining the expression 
25 levels and/or structural alterations of one or more CSNA and/or CSP in a sample from a 
patient suspected of having colon cancer. In general, the method comprises the steps of 
obtaining the sample from the patient, determining the expression level or structural 
alterations of a CSNA and/or CSP and then ascertaining whether the patient has colon 
cancer from the expression level of the CSNA or CSP. In general, if high expression 
30 relative to a control of a CSNA or CSP is indicative of colon cancer, a diagnostic assay is 
considered positive if the level of expression of the CSNA or CSP is at least one and a half 
times higher, and more preferably are at least two times higher, still more preferably five 
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times higher, even more preferably at least ten times higher, than in preferably the same 
cells, tissues or bodily fluid of a normal human control. In contrast, if low expression 
relative to a control of a CSNA or CSP is indicative of colon cancer, a diagnostic assay is 
considered positive if the level of expression of the CSNA or CSP is at least one and a half 
5 times lower, and more preferably are at least two times lower, still more preferably five 
times lower, even more preferably at least ten times lower than in preferably the same 
cells, tissues or bodily fluid of a normal human control. The normal human control may 
be from a different patient or from uninvolved tissue of the same patient. 

The present invention also provides a method of determining whether colon cancer 
1 0 has metastasized in a patient. One may identify whether the colon cancer has metastasized 
by measuring the expression levels and/or structural alterations of one or more CSNAs 
and/or CSPs in a variety of tissues. The presence of a CSNA or CSP in a tissue other than 
colon at levels higher than that of corresponding noncancerous tissue (e.g, the same tissue 
from another individual) is indicative of metastasis if high level expression of a CSNA or 
1 5 CSP is associated with colon cancer. Similarly, the presence of a CSNA or CSP in a 
tissue other than colon at levels lower than that of corresponding noncancerous tissue is 
indicative of metastasis if low level expression of a CSNA or CSP is associated with colon 
cancer. Further, the presence of a structurally altered CSNA or CSP that is associated 
with colon cancer is also indicative of metastasis. 
20 In general, if high expression relative to a control of a CSNA or CSP is indicative 

of metastasis, an assay for metastasis is considered positive if the level of expression of 
the CSNA or CSP is at least one and a half times higher, and more preferably are at least 
two times higher, still more preferably five times higher, even more preferably at least ten 
times higher, than in preferably the same cells, tissues or bodily fluid of a normal human 
25 control. In contrast, if low expression relative to a control of a CSNA or CSP is indicative 
of metastasis, an assay for metastasis is considered positive if the level of expression of 
the CSNA or CSP is at least one and a half times lower, and more preferably are at least 
two times lower, still more preferably five times lower, even more preferably at least ten 
times lower than in preferably the same cells, tissues or bodily fluid of a normal human 
30 control. 
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The invention also provides a method of staging colon cancer in a human patient 
The method comprises identifying a human patient having colon cancer and analyzing 
cells, tissues or hodily fluids from such human patient for expression levels and/or 
structural alterations of one or more CSNAs or CSPs. First, one or more tumors from a 

5 variety of patients are staged according to procedures well known in the art, and the 

expression levels of one or more CSNAs or CSPs is determined for each stage to obtain a 
standard expression level for each CSNA and CSP. Then, the CSNA or CSP expression 
levels of the CSNA or CSP are determined in a biological sample from a patient whose 
stage of cancer is not known. The CSNA or CSP expression levels from the patient are 

10 then compared to the standard expression level. By comparing the expression level of the 
CSNAs and CSPs from the patient to the standard expression levels, one may determine 
the stage of the tumor. The same procedure may be followed using structural alterations 
of a CSNA or CSP to determine the stage of a colon cancer. 

Monitoring 

15 Further provided is a method of monitoring colon cancer in a human patient. One 

may monitor a human patient to determine whether there has been metastasis and, if there 
has been, when metastasis began to occur. One may also monitor a human patient to 
determine whether a preneoplastic lesion has become cancerous. One may also monitor a 
human patient to determine whether a therapy, e.g. 9 chemotherapy, radiotherapy or 

20 surgery, has decreased or eliminated the colon cancer. The monitoring may determine if 
there has been a reoccurrence and, if so, determine its nature. The method comprises 
identifying a human patient that one wants to monitor for colon cancer, periodically 
analyzing cells, tissues or bodily fluids from such human patient for expression levels of 
one or more CSNAs or CSPs, and comparing the CSNA or CSP levels over time to those 

25 CSNA or CSP expression levels obtained previously. Patients may also be monitored by 
measuring one or more structural alterations in a CSNA or CSP that are associated with 
colon cancer. 

If increased expression of a CSNA or CSP is associated with metastasis, treatment 
failure, or conversion of a preneoplastic lesion to a cancerous lesion, then detecting an 
30 increase in the expression level of a CSNA or CSP indicates that the tumor is 

metastasizing, that treatment has failed or that the lesion is cancerous, respectively. One 
having ordinary skill in the art would recognize that if this were the case, then a decreased 
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expression level would be indicative of no metastasis, effective therapy or failure to 
progress to a neoplastic lesion. If decreased expression of a CSNA or CSP is associated 
with metastasis, treatment failure, or conversion of a preneoplastic lesion to a cancerous 
lesion, then detecting a decrease in the expression level of a CSNA or CSP indicates that 
5 the tumor is metastasizing, that treatment has failed or that the lesion is cancerous, 
respectively. In a preferred embodiment, the levels of CSNAs or CSPs are determined 
from the same cell type, tissue or bodily fluid as prior patient samples. Monitoring a 
patient for onset of colon cancer metastasis is periodic and preferably is done on a 
quarterly basis, but may be done more or less frequently. 

10 The methods described herein can further be utilized as prognostic assays to 

identify subjects having or at risk of developing a disease or disorder associated with 
increased or decreased expression levels of a CSNA and/or CSP. The present invention 
provides a method in which a test sample is obtained from a human patient and one or 
more CSNAs and/or CSPs are detected. The presence of higher (or lower) CSNA or CSP 

15 levels as compared to normal human controls is diagnostic for the human patient being at 
risk for developing cancer, particularly colon cancer. The effectiveness of therapeutic 
agents to decrease (or increase) expression or activity of one or more CSNAs and/or CSPs 
of the invention can also be monitored by analyzing levels of expression of the CSNAs 
and/or CSPs in a human patient in clinical trials or in in vitro screening assays such as in 

20 human cells. In this way, the gene expression pattern can serve as a marker, indicative of 
the physiological response of the human patient or cells, as the case may be, to the agent 
being tested. 

Detection of Genetic Lesions or Mutations 

The methods of the present invention can also be used to detect genetic lesions or 
25 mutations in a CSG, thereby determining if a human with the genetic lesion is susceptible 
to developing colon cancer or to determine what genetic lesions are responsible, or are 
partly responsible, for a person's existing colon cancer. Genetic lesions can be detected, 
for example, by ascertaining the existence of a deletion, insertion and/or substitution of 
one or more nucleotides from the CSGs of this invention, a chromosomal rearrangement 
30 of a CSG, an aberrant modification of a CSG (such as of the methylation pattern of the 
genomic DNA), or allelic loss of a CSG. Methods to detect such lesions in the CSG of 
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this invention are known to those having ordinary skill in the art following the teachings 
of the specification. 

Methods of Detecting Noncancerous Colon Diseases 

The present invention also provides methods for determining the expression levels 
5 and/or structural alterations of one or more CSNAs and/or CSPs in a sample from a patient 
suspected of having or known to have a noncancerous colon disease. In general, the 
method comprises the steps of obtaining a sample from the patient, determining the 
expression level or structural alterations of a CSNA and/or CSP, comparing the expression 
level or structural alteration of the CSNA or CSP to a normal colon control, and then 

10 ascertaining whether the patient has a noncancerous colon disease. In general, if high 
expression relative to a control of a CSNA or CSP is indicative of a particular 
noncancerous colon disease, a diagnostic assay is considered positive if the level of 
expression of the CSNA or CSP is at least two times higher, and more preferably are at 
least five times higher, even more preferably at least ten times higher, than in preferably 

15 the same cells, tissues or bodily fluid of a normal human control. In contrast, if low 

expression relative to a control of a CSNA or CSP is indicative of a noncancerous colon 
disease, a diagnostic assay is considered positive if the level of expression of the CSNA or 
CSP is at least two times lower, more preferably are at least five times lower, even more 
preferably at least ten times lower than in preferably the same cells, tissues or bodily fluid 

20 of a normal human control. The normal human control may be from a different patient or 
from uninvolved tissue of the same patient 

One having ordinary skill in the art may determine whether a CSNA and/or CSP is 
associated with a particular noncancerous colon disease by obtaining colon tissue from a 
patient having a noncancerous colon disease of interest and determining which CSNAs 

25 and/or CSPs are expressed in the tissue at either a higher or a lower level than in normal 
colon tissue. In another embodiment, one may determine whether a CSNA or CSP 
exhibits structural alterations in a particular noncancerous colon disease state by obtaining 
colon tissue from a patient having a noncancerous colon disease of interest and 
determining the structural alterations in one or more CSNAs and/or CSPs relative to 

30 normal colon tissue. 
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Methods for Identifying Colon Tissue 

In another aspect, the invention provides methods for identifying colon tissue. 
These methods are particularly useful in, e.g, forensic science,- colon cell differentiation 
and development, and in tissue engineering. 
5 In one embodiment, the invention provides a method for determining whether a 

sample is colon tissue or has colon tissue-like characteristics. The method comprises the 
steps of providing a sample suspected of comprising colon tissue or having colon tissue- 
like characteristics, determining whether the sample expresses one or more CSNAs and/or 
CSPs, and, if the sample expresses one or more CSNAs and/or CSPs, concluding that the 

10 sample comprises colon tissue. In a preferred embodiment, the CSNA encodes a 

polypeptide having an amino acid sequence selected from SEQ ID NO: 1 13-259, or a 
homolog, allelic variant or fragment thereof. In a more preferred embodiment, the CSNA 
has a nucleotide sequence selected from SEQ ID NO: 1-1 12, or a hybridizing nucleic acid, 
an allelic variant or a part thereof. Determining whether a sample expresses a CSNA can 

15 be accomplished by any method known in the art. Preferred methods include 

hybridization to microarrays, Northern blot hybridization, and quantitative or qualitative 
RT-PCR. In another preferred embodiment, the method can be practiced by determining 
whether a CSP is expressed. Determining whether a sample expresses a CSP can be 
accomplished by any method known in the art. Preferred methods include Western blot, 

20 ELISA, RIA and 2D PAGE. In one embodiment, the CSP has an amino acid sequence 
selected from SEQ ID NO: 1 13-259, or a homolog, allelic variant or fragment thereof. In 
another preferred embodiment, the expression of at least two CSNAs and/or CSPs is 
determined. In a more preferred embodiment, the expression of at least three, more 
preferably four and even more preferably five CSNAs and/or CSPs are determined. 

25 In one embodiment, the method can be used to determine whether an unknown 

tissue is colon tissue. This is particularly useful in forensic science, in which small, 
damaged pieces of tissues that are not identifiable by microscopic or other means are 
recovered from a crime or accident scene. In another embodiment, the method can be 
used to determine whether a tissue is differentiating or developing into colon tissue. This 

30 is important in monitoring the effects of the addition of various agents to cell or tissue 
culture, e.g. , in producing new colon tissue by tissue engineering. These agents include, 
e.g., growth and differentiation factors, extracellular matrix proteins and culture medium. 
Other factors that may be measured for effects on tissue development and differentiation 
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include gene transfer into the cells or tissues, alterations in pH, aqueous:air interface and 
various other culture conditions. 

Methods for Producing and Modifying Colon Tissue 

In another aspect, the invention provides methods for producing engineered colon 
5 tissue or cells. In one embodiment, the method comprises the steps of providing cells, 
introducing a CSNA or a CSG into the cells, and growing the cells under conditions in 
which they exhibit one or more properties of colon tissue cells. In a preferred 
embodiment, the cells are pleuripotent As is well known in the art, normal colon tissue 
comprises a large number of different cell types. Thus, in one embodiment, the 

10 engineered colon tissue or cells comprises one of these cell types. In another embodiment, 
the engineered colon tissue or cells comprises more than one colon cell type. Further, the 
culture conditions of the cells or tissue may require manipulation in order to achieve full 
differentiation and development of the colon cell tissue. Methods for manipulating culture 
conditions are well known in the art 

15 Nucleic acid molecules encoding one or more CSPs are introduced into cells, 

preferably pleuripotent cells. In a preferred embodiment, the nucleic acid molecules 
encode CSPs having amino acid sequences selected from SEQ ID NO: 1 13-259, or 
homologous proteins, analogs, allelic variants or fragments thereof. In a more preferred 
embodiment, the nucleic acid molecules have a nucleotide sequence selected from SEQ ID 

20 NO: 1-1 12, or hybridizing nucleic acids, allelic variants or parts thereof. In another highly 
preferred embodiment, a CSG is introduced into the cells. Expression vectors and 
methods of introducing nucleic acid molecules into cells are well known in the art and are 
described in detail, supra. 

Artificial colon tissue may be used to treat patients who have lost some or all of 

25 their colon function. 

Pharmaceutical Compositions 

In another aspect, the invention provides pharmaceutical compositions comprising 
the nucleic acid molecules, polypeptides, fusion proteins, antibodies, antibody derivatives, 
antibody fragments, agonists, antagonists, or inhibitors of the present invention. In a 
30 preferred embodiment, the pharmaceutical composition comprises a CSNA or part thereof. 
In a more preferred embodiment, the CSNA has a nucleotide sequence selected from the 
group consisting of SEQ ID NO: 1-1 12, a nucleic acid that hybridizes thereto, an allelic 
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variant thereof, or a nucleic acid that has substantial sequence identity thereto. In another 
preferred embodiment, the pharmaceutical composition comprises a CSP or fragment 
thereof. In a more preferred embodiment, the pharmaceutical composition comprises a 
CSP having an amino acid sequence that is selected from the group consisting of SEQ ID 
5 NO: 1 13-259, a polypeptide that is homologous thereto, a fusion protein comprising all or 
a portion of the polypeptide, or an analog or derivative thereof. In another preferred 
embodiment, the pharmaceutical composition comprises an anti-CSP antibody, preferably 
an antibody that specifically binds to a CSP having an amino acid that is selected from the 
group consisting of SEQ ID NO: 1 13-259, or an antibody that binds to a polypeptide that 

10 is homologous thereto, a fusion protein comprising all or a portion of the polypeptide, or 
an analog or derivative thereof. 

Due to the association of angiogenesis with cancer vascularization there is great 
need of new markers and methods for diagnosing angiogenesis activity to identify 
developing tumors and angiogenesis related diseases. Furthermore, great need is also 

15 present for new molecular targets useful in the treatment of angiogenesis and angiogenesis 
related diseases such as cancer. In addition known modulators of angiogenesis such as 
endostatin or vascular endothelial growth factor (VEGF). Use of the methods and 
compositions disclosed herein in combination with anti-angiogenesis drugs, drugs that 
block the matrix breakdown (such as BMS-275291, Dalteparin (Fragmin®), Suramin), 

20 drugs that inhibit endothelial cells (2-methoxyestradiol (2-ME), CC-5013 (Thalidomide 
Analog), Combretastatin A4 Phosphate, LY317615 (Protein Kinase C Beta Inhibitor), Soy 
Isoflavone (Genistein; Soy Protein Isolate), Thalidomide), drugs that block activators of 
angiogenesis (AE-941 (Neovastat™; GW786034), Anti-VEGF Antibody (Bevacizumab; 
Avastin™), Interferon-alpha, PTK787/ZK 222584, VEGF-Trap, ZD6474), Drugs that 

25 inhibit endothelial-specific integrin/survival signaling (EMD 121974, Anti-Anb3 Integrin 
Antibody (Medi-522; Vitaxin™)). 

Such a composition typically contains from about 0.1 to 90% by weight of a 
therapeutic agent of the invention formulated in and/or with a pharmaceutical^ acceptable 
carrier or excipient 

30 Pharmaceutical formulation is a well-established art that is further described in 

Gennaro (ed.), Remington: The Science and Practice of Pharmacy. 20 th ed., Lippincott, 
Williams & Wilkins (2000); Ansel et al , Pharmaceutical Dosage Forms and Drug 
Delivery Systems. 7 th ed., Lippincott Williams & Wilkins (1999); and Kibbe (ed.), 



WO 2004/050900 



PCT/US2003/040131 



118 

Handbook of Pharmaceutical Excipients American Pharmaceutical Association, 3 rd ed. 
(2000) and thus need not be described in detail herein. 

Briefly, formulation of the pharmaceutical compositions of the present invention 
will depend upon the route chosen for administration- The pharmaceutical compositions 
5 utilized in this invention can be administered by various routes including both enteral and 
parenteral routes, including oral, intravenous, intramuscular, subcutaneous, inhalation, 
topical, sublingual, rectal, intra-arterial, intramedullary, intrathecal, intraventricular, 
transmucosal, transdermal, intranasal, intraperitoneal, intrapulmonary, and intrauterine. 

Oral dosage forms can be formulated as tablets, pills, dragees, capsules, liquids, 
10 gels, syrups, slurries, suspensions, and the like, for ingestion by the patient. 

Solid formulations of the compositions for oral administration can contain suitable 
carriers or excipients, such as carbohydrate or protein fillers, such as sugars, including 
lactose, sucrose, mannitol, or sorbitol; starch from corn, wheat, rice, potato, or other 
plants; cellulose, such as methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
15 carboxymethylcellulose, or microcrystalline cellulose; gums including arabic and 

tragacanth; proteins such as gelatin and collagen; inorganics, such as kaolin, calcium 
carbonate, dicalcium phosphate, sodium chloride; and other agents such as acacia and 
alginic acid. 

Agents that facilitate disintegration and/or solubilization can be added, such as the 
20 cross-linked polyvinyl pyrrolidone, agar, alginic acid, or a salt thereof, such as sodium 
alginate, microcrystalline cellulose, cornstarch, sodium starch glycolate, and alginic acid. 

Tablet binders that can be used include acacia, methylcellulose, sodium 
carboxymethylcellulose, polyvinylpyrrolidone (Povidone™), hydroxypropyl 
methylcellulose, sucrose, starch and ethylcellulose. 
25 Lubricants that can be used include magnesium stearates, stearic acid, silicone 

fluid, talc, waxes, oils, and colloidal silica. 

Fillers, agents that facilitate disintegration and/or solubilization, tablet binders and 
lubricants, including the aforementioned, can be used singly or in combination. 

Solid oral dosage forms need not be uniform throughout. For example, dragee 
30 cores can be used in conjunction with suitable coatings, such as concentrated sugar 
solutions, which can also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic 
solvents or solvent mixtures. 
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Oral dosage forms of the present invention include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or 
sorbitol. Push-fit capsules can contain active ingredients mixed with a filler or binders, 
such as lactose or starches, lubricants, such as talc or magnesium stearate, and, optionally, 
5 stabilizers. In soft capsules, the active compounds can be dissolved or suspended in 
suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with or without 
stabilizers. 

Additionally, dyestuffs or pigments can be added to the tablets or dragee coatings 
for product identification or to characterize the quantity of active compound, z.e., dosage. 

10 Liquid formulations of the pharmaceutical compositions for oral (enteral) 

administration are prepared in water or other aqueous vehicles and can contain various 
suspending agents such as methylcellulose, alginates, tragacanth, pectin, kelgin, 
carrageenan, acacia, polyvinylpyrrolidone, and polyvinyl alcohol. The liquid formulations 
can also include solutions, emulsions, syrups and elixirs containing, together with the 

15 active compound(s), wetting agents, sweeteners, and coloring and flavoring agents. 

The pharmaceutical compositions of the present invention can also be formulated 
for parenteral administration. Formulations for parenteral administration can be in the 
form of aqueous or non-aqueous isotonic sterile injection solutions or suspensions. 

For intravenous injection, water soluble versions of the compounds of the present 

20 invention are formulated in, or if provided as a lyophilate, mixed with, a physiologically 
acceptable fluid vehicle, such as 5% dextrose ("D5"), physiologically buffered saline, 
0.9% saline, Hanks' solution, or Ringer's solution. Intravenous formulations may include 
carriers, excipients or stabilizers including, without limitation, calcium, human serum 
albumin, citrate, acetate, calcium chloride, carbonate, and other salts. 

25 Intramuscular preparations, e.g. a sterile formulation of a suitable soluble salt form 

of the compounds of the present invention, can be dissolved and administered in a 
pharmaceutical excipient such as Water-for-Injection, 0.9% saline, or 5% glucose solution. 
Alternatively, a suitable insoluble form of the compound can be prepared and 
administered as a suspension in an aqueous base or a pharmaceutically acceptable oil base, 

30 such as an ester of a long chain fatty acid (e.g. 9 ethyl oleate), fatty oils such as sesame oil, 
triglycerides, or liposomes. 

Parenteral formulations of the compositions can contain various carriers such as 
vegetable oils, dimethylacetamide, dimethylformamide, ethyl lactate, ethyl carbonate, 
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isopropyl myristate, ethanol, polyols (glycerol, propylene glycol, liquid polyethylene 
glycol, and the like). 

Aqueous injection suspensions can also contain substances that increase the 
viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. 
5 Non-lipid polycationic amino polymers can also be used for delivery. Optionally, the 
suspension can also contain suitable stabilizers or agents that increase the solubility of the 
compounds to allow for the preparation of highly concentrated solutions. 

Pharmaceutical compositions of the present invention can also be formulated to 
permit injectable, long-term, deposition. Injectable depot forms may be made by forming 

10 microencapsulated matrices of the compound in biodegradable polymers such as 

polylactide-polyglycolide. Depending upon the ratio of drug to polymer and the nature of 
the particular polymer employed, the rate of drug release can be controlled. Examples of 
other biodegradable polymers include poly(orthoesters) and poly(anhydrides). Depot 
injectable formulations are also prepared by entrapping the drug in microemulsions that 

15 are compatible with body tissues. 

The pharmaceutical compositions of the present invention can be administered 
topically. For topical use the compounds of the present invention can also be prepared in 
suitable forms to be applied to the skin, or mucus membranes of the nose and throat, and 
can take the form of lotions, creams, ointments, liquid sprays or inhalants, drops, tinctures, 

20 lozenges, or throat paints. Such topical formulations further can include chemical 

compounds such as dimethylsulfoxide (DMSO) to facilitate surface penetration of the 
active ingredient In other transdermal formulations, typically in patch-delivered 
formulations, the pharmaceutically active compound is formulated with one or more skin 
penetrants, such as 2-N-methyl-pyirolidone (NMP) or Azone. A topical semi-solid 

25 ointment formulation typically contains a concentration of the active ingredient from 
about 1 to 20%, e.g., 5 to 10%, in a carrier such as a pharmaceutical cream base. 

For application to the eyes or ears, the compounds of the present invention can be 
presented in liquid or semi-liquid form formulated in hydrophobic or hydrophilic bases as 
ointments, creams, lotions, paints or powders. 

30 For rectal administration the compounds of the present invention can be 

administered in the form of suppositories admixed with conventional carriers such as 
cocoa butter, wax or other glyceride. 
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Inhalation formulations can also readily be formulated For inhalation, various 
powder and liquid formulations can be prepared. For aerosol preparations, a sterile 
formulation of the compound or salt form of the compound may be used in inhalers, such 
as metered dose inhalers, and nebulizers. Aerosolized forms may be especially useful for 
5 treating respiratory disorders. 

Alternatively, the compounds of the present invention can be in powder form for 
reconstitution in the appropriate pharmaceutically acceptable carrier at the time of 
delivery. 

The pharmaceutically active compound in the pharmaceutical compositions of the 
1 0 present invention can be provided as the salt of a variety of acids, including but not limited 
to hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and succinic acid. Salts tend to be 
more soluble in aqueous or other protonic solvents than are the corresponding free base 
forms. 

After pharmaceutical compositions have been prepared, they are packaged in an 
1 5 appropriate container and labeled for treatment of an indicated condition. 

The active compound will be present in an amount effective to achieve the 
intended purpose. The determination of an effective dose is well within the capability of 
those skilled in the art 

A "therapeutically effective dose' 5 refers to that amount of active ingredient, for 
20 example CSP polypeptide, fusion protein, or fragments thereof, antibodies specific for 
CSP, agonists, antagonists or inhibitors of CSP, which ameliorates the signs or symptoms 
of the disease or prevent progression thereof; as would be understood in the medical arts, 
cure, although desired, is not required. 

The therapeutically effective dose of the pharmaceutical agents of the present 
25 invention can be estimated initially by in vitro tests, such as cell culture assays, followed 
by assay in model animals, usually mice, rats, rabbits, dogs, or pigs. The animal model 
can also be used to determine an initial preferred concentration range and route of 
administration. 

For example, the ED50 (the dose therapeutically effective in 50% of the 
30 population) and LD50 (the dose lethal to 50% of the population) can be determined in one 
or more cell culture of animal model systems. The dose ratio of toxic to therapeutic 
effects is the therapeutic index, which can be expressed as LD50/ED50. Pharmaceutical 
compositions that exhibit large therapeutic indices are preferred. 
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The data obtained from cell culture assays and animal studies are used in 
formulating an initial dosage range for human use, and preferably provide a range of 
circulating concentrations that includes the ED50 with little or no toxicity. After 
administration, or between successive administrations, the circulating concentration of 
5 active agent varies within this range depending upon pharmacokinetic factors well known 
in the art, such as the dosage form employed, sensitivity of the patient, and the route of 
administration. 

The exact dosage will be determined by the practitioner, in light of factors specific 
to the subject requiring treatment Factors that can be taken into account by the 

10 practitioner include the severity of the disease state, general health of the subject, age, 
weight, gender of the subject, diet, time and frequency of administration, drug 
combination^), reaction sensitivities, and tolerance/response to therapy. Long-acting 
pharmaceutical compositions can be administered every 3 to 4 days, every week, or once 
every two weeks depending on half-life and clearance rate of the particular formulation. 

15 Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up to a total 

dose of about 1 g, depending upon the route of administration. Where the therapeutic 
agent is a protein or antibody of the present invention, the therapeutic protein or antibody 
agent typically is administered at a daily dosage of 0.01 mg to 30 mg/kg of body weight of 
the patient (e.g., Img/kg to 5 mg/kg). The pharmaceutical formulation can be 

20 administered in multiple doses per day, if desired, to achieve the total desired daily dose. 
Guidance as to particular dosages and methods of delivery is provided in the 
literature and generally available to practitioners in the art. Those skilled in the art will 
employ different formulations for nucleotides than for proteins or their inhibitors. 
Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 

25 conditions, locations, etc. 

Conventional methods, known to those of ordinary skill in the art of medicine, can 
be used to administer the pharmaceutical formulation(s) of the present invention to the 
patient. The pharmaceutical compositions of the present invention can be administered 
alone, or in combination with other therapeutic agents or interventions. 



30 



Therapeutic Methods 

The present invention further provides methods of treating subjects having defects 
in a gene of the invention, e.g., in expression, activity, distribution, localization, and/or 
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solubility, which can manifest as a disorder of colon function. As used herein, 'treating" 
includes all medically-acceptable types of therapeutic intervention, including palliation 
and prophylaxis (prevention) of disease. The term "treating" encompasses any 
improvement of a disease, including minor improvements. These methods are discussed 
5 below. 

Gene Therapy and Vaccines 

The isolated nucleic acids of the present invention can also be used to drive in vivo 
expression of the polypeptides of the present invention. In vivo expression can be driven 
from a vector, typically a viral vector, often a vector based upon a replication incompetent 

10 retrovirus, an adenovirus, or an adeno-associated virus (AAV), for the purpose of gene 
therapy. In vivo expression can also be driven from signals endogenous to the nucleic acid 
or from a vector, often a plasmid vector, such as pVAXl (Invitrogen, Carlsbad, CA, 
USA), for purpose of "naked" nucleic acid vaccination, as further described in U.S. Patent 
Nos. 5,589,466; 5,679,647; 5,804,566; 5,830,877; 5,843,913; 5,880,104; 5,958,891; 

15 5,985,847; 6,017,897; 6,1 10,898; 6,204,250, the disclosures of which are incorporated 
herein by reference in their entireties. For cancer therapy, it is preferred that the vector 
also be tumor-selective. See, e.g., Doronin et al. 9 J. Virol 75: 3314-24 (2001). 

In another embodiment of the therapeutic methods of the present invention, a 
therapeutically effective amount of a pharmaceutical composition comprising a nucleic 

20 acid molecule of the present invention is administered. The nucleic acid molecule can be 
delivered in a vector that drives expression of a CSP, fusion protein, or fragment thereof, 
or without such vector. Nucleic acid compositions that can drive expression of a CSP are 
administered, for example, to complement a deficiency in the native CSP, or as DNA 
vaccines. Expression vectors derived from virus, replication deficient retroviruses, 

25 adenovirus, adeno-associated (AAV) virus, herpes virus, or vaccinia virus can be used as 
can plasmids. See, e.g., Cid-Arregui, supra. In a preferred embodiment, the nucleic acid 
molecule encodes a CSP having the amino acid sequence of SEQ ID NO: 113-259, or a 
fragment, fusion protein, allelic variant or homolog thereof. 

In still other therapeutic methods of the present invention, pharmaceutical 

30 compositions comprising. host cells that express a CSP, fusions, or fragments thereof can 
be administered. In such cases, the cells are typically autologous, so as to circumvent 
xenogeneic or allotypic rejection, and are administered to complement defects in CSP 



WO 2004/050900 



PCTYUS2003/040131 



124 

production or activity. In a preferred embodiment, the nucleic acid molecules in the cells 
encode a CSP having the amino acid sequence of SEQ ID NO: 1 13-259, or a fragment, 
fusion protein, allelic variant or homolog thereof. 

Antisetise Administration 
5 Antisense nucleic acid compositions, or vectors that drive expression of a CSG 

antisense nucleic acid, are administered to downregulate transcription and/or translation of 
a CSG in circumstances in which excessive production, or production of aberrant protein, 
is the pathophysiologic basis of disease. 

Antisense compositions useful in therapy can have a sequence that is 

10 complementary to coding or to noncoding regions of a CSG. For example, 

oligonucleotides derived from the transcription initiation site, e.g, between positions -10 
and +10 from the start site, are preferred. 

Catalytic antisense compositions, such as ribozymes, that are capable of 
sequence-specific hybridization to CSG transcripts, are also useful in therapy. See, e.g. 9 

15 Phylactou, Adv. Drug Deliv. Rev. 44(2-3): 97-108 (2000); Phylactou et aL, Hum. Mol 
Genet. 7(10): 1649-53 (1998); Rossi, Ciba Found. Symp. 209: 195-204 (1997); and 
Sigurdsson et al. 9 Trends Biotechnol 13(8): 286-9 (1995). 

Other nucleic acids useful in the therapeutic methods of the present invention are 
those that are capable of triplex helix formation in or near the CSG genomic locus. Such 

20 triplexing oligonucleotides are able to inhibit transcription. See, e.g. t Intody et al. 9 Nucleic 
Acids Res. 28(21): 4283-90 (2000); and McGuffie et al , Cancer Res. 60(14): 3790-9 
(2000). Pharmaceutical compositions comprising such triplex forming oligos (TFOs) are 
administered in circumstances in which excessive production, or production of aberrant 
protein, is a pathophysiologic basis of disease. 

25 In a preferred embodiment, the antisense molecule is derived from a nucleic acid 

molecule encoding a CSP, preferably a CSP comprising an amino acid sequence of SEQ 
ED NO: 1 13-259, or a fragment, allelic variant or homolog thereof. In a more preferred 
embodiment, the antisense molecule is derived from a nucleic acid molecule having a 
nucleotide sequence of SEQ ID NO: 1-1 12, or a part, allelic variant, substantially similar 

30 or hybridizing nucleic acid thereof. 



Polypeptide Administration 
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In one embodiment of the therapeutic methods of the present invention, a 
therapeutically effective amount of a pharmaceutical composition comprising a CSP, a 
fusion protein, fragment, analog or derivative thereof is administered to a subject with a 
clinically-significant CSP defect 
5 Protein compositions are administered, for example, to complement a deficiency in 

native CSP. In other embodiments, protein compositions are administered as a vaccine to 
elicit a humoral and/or cellular immune response to CSP. The immune response can be 
used to modulate activity of CSP or, depending on the immunogen, to immunize against 
aberrant or aberrantly expressed forms, such as mutant or inappropriately expressed 

10 isoforms. In yet other embodiments, protein fusions having a toxic moiety are 
administered to ablate cells that aberrantly accumulate CSP. 

In a preferred embodiment, the polypeptide administered is a CSP comprising an 
amino acid sequence of SEQ ID NO: 1 13-259, or a fusion protein, allelic variant, 
homolog, analog or derivative thereof. In a more preferred embodiment, the polypeptide 

15 is encoded by a nucleic acid molecule having a nucleotide sequence of SEQ ID NO: 1- 
1 12, or a part, allelic variant, substantially similar or hybridizing nucleic acid thereof. 

Antibody, Agonist and Antagonist Administration 

In another embodiment of the therapeutic methods of the present invention, a 
therapeutically effective amount of a pharmaceutical composition comprising an antibody 

20 (including fragment or derivative thereof) of the present invention is administered. As is 
well known, antibody compositions are administered, for example, to antagonize activity 
of CSP, or to target therapeutic agents to sites of CSP presence and/or accumulation. In a 
preferred embodiment, the antibody specifically binds to a CSP comprising an amino acid 
sequence of SEQ ID NO: 1 13-259, or a fusion protein, allelic variant, homolog, analog or 

25 derivative thereof. In a more preferred embodiment, the antibody specifically binds to a 
CSP encoded by a nucleic acid molecule having a nucleotide sequence of SEQ ID NO: 1- 
1 12, or a part, allelic variant, substantially similar or hybridizing nucleic acid thereof. 

The present invention also provides methods for identifying modulators which 
bind to a CSP or have a modulatory effect on the expression or activity of a CSP. 

30 Modulators which decrease the expression or activity of CSP (antagonists) are believed to 
be useful in treating colon cancer. Such screening assays are known to those of skill in the 
art and include, without limitation, cell-based assays and cell-free assays. Small molecules 
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predicted via computer imaging to specifically bind to regions of a CSP can also be 

designed, synthesized and tested for use in the imaging and treatment of colon cancer. 

Further, libraries of molecules can be screened for potential anticancer agents by assessing 

the ability of the molecule to bind to the CSPs identified herein. Molecules identified in 
5 the library as being capable of binding to a CSP are key candidates for further evaluation 

for use in the treatment of colon cancer. In a preferred embodiment, these molecules will 

downregulate expression and/or activity of a CSP in cells. 

In another embodiment of the therapeutic methods of the present invention, a 

pharmaceutical composition comprising a non-antibody antagonist of CSP is administered. 
10 Antagonists of CSP can be produced using methods generally known in the art In 

particular, purified CSP can be used to screen libraries of pharmaceutical agents, often 

combinatorial libraries of small molecules, to identify those that specifically bind and 

antagonize at least one activity of a CSP. 

In other embodiments a pharmaceutical composition comprising an agonist of a 
1 5 CSP is administered. Agonists can be identified using methods analogous to those used to 

identify antagonists. 

In a preferred embodiment, the antagonist or agonist specifically binds to and 

antagonizes or agonizes, respectively, a CSP comprising an amino acid sequence of SEQ 

ID NO: 1 13-259, or a fusion protein, allelic variant, homolog, analog or derivative thereof. 
20 In a more preferred embodiment, the antagonist or agonist specifically binds to and 

antagonizes or agonizes, respectively, a CSP encoded by a nucleic acid molecule having a 

nucleotide sequence of SEQ ID NO: 1-1 12, or a part, allelic variant, substantially similar 

or hybridizing nucleic acid thereof. 

25 Targeting Colon Tissue 

The invention also provides a method in which a polypeptide of the invention, or 
an antibody thereto, is linked to a therapeutic agent such that it can be delivered to the 
colon or to specific cells in the colon. In a preferred embodiment, an anti-CSP antibody is 
linked to a therapeutic agent and is administered to a patient in need of such therapeutic 

30 agent. The therapeutic agent may be a toxin, if colon tissue needs to be selectively 

destroyed. This would be useful for targeting and killing colon cancer cells. In another 
embodiment, the therapeutic agent may be a growth or differentiation factor, which would 
be useful for promoting colon cell function. 



WO 2004/050900 



PCT/US2003/040131 



127 

In another embodiment, an anti-CSP antibody may be linked to an imaging agent 
that can be detected using, e.g. 9 magnetic resonance imaging, CT or PET. This would be 
useful for determining and monitoring colon function, identifying colon cancer tumors, 
and identifying noncancerous colon diseases. 

5 EXAMPLES 

Example la: Alternative Splice Variants 

We identified gene transcripts using the Gencarta™ tools (Compugen Ltd., Tel 
Aviv, Israel) and a variety of public and proprietary databases. These splice variants are 
either sequences which differ from a previously defined sequence or new uses of known 

10 sequences. In general related variants are annotated as DEX0449__XXX.nt. 1, 

DEX0449JXXX.nt.2, DEX0449_XXX.nt.3, etc. The variant DNA sequences encode 
proteins which differ from a previously defined protein sequence. In relation to the 
nucleotide sequence naming convention, protein variants are annotated as 
DEX0449JQOC.aa.l, DEX0449_XXX.aa.2, etc., wherein transcript DEX0449_XXX.ntl 

1 5 encodes protein DEX0449_XXX.aa. 1 . A single transcript may encode a protein from an 
alternate Open Reading Fram (ORF) which is designated DEX0449_XXX.orf.l. 
Additionally, multiple transcripts may encode for a single protein. In this case, 
DEX0449_XXX.nt. 1 and DEX0449_XXX.nt2 will both be associated with 
DEX0449_XXX.aa.l. 

20 The mapping of the nucleic acid ("NT") SEQ ID NO; DEX ED; chromosomal 

location (if known); open reading frame (ORF) location; amino acid ("AA") SEQ ID NO; 
AA DEX ID; are shown in the table below. 
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The polypeptides of the present invention were analyzed and the following 
attributes were identified; specifically, epitopes, post translational modifications, signal 

5 peptides and transmembrane domains. Antigenicity (Epitope) prediction was performed 
through the antigenic module in the EMBOSS package. Rice, P., EMBOSS: The 
European Molecular Biology Open Software Suite, Trends in Genetics 16(6): 276-277 
(2000). The antigenic module predicts potentially antigenic regions of a protein sequence, 
using the method of Kolaskar and Tongaonkar. Kolaskar, AS and Tongaonkar, PC, A 

10 semi-empirical method for prediction of antigenic determinants on protein antigens, FEBS 
Letters 276: 172-174 (1990). Examples of post-translational modifications (PTMs) and 
other motifs of the CSPs of this invention are listed below. In addition, antibodies that 
specifically bind such post-translational modifications may be useful as a diagnostic or as 
therapeutic. The PTMs and other motifs were predicted by using the ProSite Dictionary of 

15 Proteins Sites and Patterns (Bairoch et al., Nucleic Acids Res. 25(1):217-221 (1997)), the 
following motifs, including PTMs, were predicted for the CSPs of the invention. The 
signal peptides were detected by using the SignalP 2.0, see Nielsen et al , Protein 
Engineering 12, 3-9 (1999). Prediction of transmembrane helices in proteins was 
performed by the application TMHMM 2.0, "currently the best performing transmembrane 

20 prediction program", according to authors (Krogh et al, Journal of Molecular Biology 
305(3):567-580, (2001); Moller et al, Bioinformatics, 17(7):646-653, (2001); 
Sonnhammer, et al, A hidden Markov model for predicting transmembrane helices in 
protein sequences in Glasgow, et al. Ed. Proceedings of the Sixth International 
Conference on Intelligent Systems for Molecular Biology, pages 175-182, Menlo Park, 

25 CA, 1998. AAAI Press. The PSORT II program may also be used to predict cellular 

localizations. Horton et al., Intelligent Systems for Molecular Biology 5: 147-152(1997). 
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The table below includes the following sequence annotations: Signal peptide presence; 
TM (number of membrane domain, topology in orientation and position); Amino acid 
location and antigenic index (location, AI score); PTM and other motifs (type, amino acid 



residue locations); and functional domains (type, amino acid residue locations). 



DEX ID 


Sig 


TMHMM 


Ant igeni ci ty 


PTM 


Domains 


DEX0449_00 
l.aa.l 


N 


0 - Ol- 
165; 


51-59,1.044; 
98- 

105,1.041; 
124- 

130,1.064; 

65-70,1.029; 

143- 

162, 1.163 ; 

83-96,1.122; 

108- 

116,1.104; 


MYRISTYL 9-14- 
MYRISTYL 44-49; 
MYRISTYL 12-17; 
CK2_PHOSPHO_SITE 
94-97; MYRISTYL 
73-78; MYRISTYL 
40-45; MYRISTYL 
77-82; 

PKC_PHOSPHO_SITE 
118-120; MYRISTYL 
17-22; 

PKC_PHOSPHO SITE 
107-109; 

CK2_PHOSPH0_SITE 
87-90; MYRISTYL 
92-97; MYRISTYL 
16-21; 


RRM 82-152; rrm 84- 
154; RRM 83-155; 
GLY_RICH 1-82; 


DEX0449 00 
l.orf .1 


N 


0 - ol- 
164; 




MYRISTYL 72-77; 
MYRISTYL 15-20; 
MYRISTYL 91-96; 
PKC_PHOSPHO_SITE 
106-108; 

CK2_PHOSPHO_SITE 
86-89; MYRISTYL 
39-44; MYRISTYL 
16-21; MYRISTYL 
43-48; MYRISTYL 
11-16; 

CK2_PHOSPHO__S ITE 
93-96; MYRISTYL 
76-81; 

PKC_PHOSPH0_SITE 
117-119 ; 


RRM 82-154; RRM 81- 
151; GLY_RICH 1-81; 
rrm 83-153; 


DEX0449_00 
2 . aa . 1 


N 


0 - ol- 
256; 


134- 

156,1.159; 
116- 

125,1.111; 

24-34,1.087; 

186- 

196,1.13; 

71-101,1.22; 

203- 

226, 1.227; 
164- 

180,1.143; 
237- 

245,1.093; 


PKC_PHOSPHO SITE 
71-73; MYRISTYL 
199-204; 

CK2_PHOSPH0_SITE 
128-131; MYRISTYL 
49-54; MYRISTYL 
233-238; MYRISTYL 
134-139; 

PKC_PHOSPHO SITE 
34-36; MYRISTYL 
161-166; 


PRO_RICH 80-129; 


DEX0449 00 


¥ 


0 - ol- 


158- 


MYRISTYL 176-181; 


PRO RICH 122-171; 
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2 . orf . 1 



DEX0449_00 
3 ,aa. 1 



250; 



N 



167, 1.111; 

66-76,1.087; 

113- 

143,1.22; 
176- 

198,1.159; 

9-36,1.178; 

206- 

222, 1.143 ; 
228- 

238,1.13; 



0 - ol 
406; 



j[PKC_PHOSPHO_S ITE 
76-78; MYRISTYL 
91-96; 

PKC_PHOSPHO_SITE 
4-6; MYRISTYL 
241-246; MYRISTYL 1 
203-208; 

CK2_PHOSPHO_SITE 
170-173; 

PKC_PHOSPHO_SITE 
3-5; 

PKC_PHOSPHO_SITE 
113-115; 



82-97, 


1.113; 


201- 




207,1. 


14; 


59-75,1.139; 


124- 




130,1, 


123; 


213- 




226, 1 


126; 


142- 




174,1 


207; 


356- 




375,1 


.115; 


251- 




278,1 


.154; 


388- 




403,1 


.127; 


334- 




340,1 


.063; 


323- 




332,1 


.205; 


182- 




191,1 


.186; 


105- 




121,1 


.136; 


4-11, 


1.111; 


239- 




248,1 


.148; 


289- 




300,1 


.104; 


20-26 


,1.083; 


450- 




457,1 


.192; 


411- 




418, 1 


.091; 


124- 




130,1 


.123; 


347- 




373, 1 


.136; 


20-26 


,1.083; 


490- 




502, 1 


.207; 


182- 




191,1 


.186; 


289- 




300,1 


.104; 


213- 





MYRISTYL 335-340; 

CK2J?HOSPHO_SITE 

30-33; 

CK2_PHOSPHO_SITE 
49-52; 

PKC_PHOSPHO_SITB 
120-122; 

CK2_PHOSPHO_SITE 
89-92; MYRISTYL 
198-203; 

PKC_PHOSPHO_SITE 
13-15; 

PKC_PHOSPHO_SITE 
49-51; 

CK2_PHOSPHO_SITE 
88-91; 

PKC_PHOSPHO_SITE 
79-81; 

PKC_PHOSPHO_SITE 
344-346; MYRISTYL 
189-194; 

ASN_GLYCOSYLATION 
379-382; 



Glyco_hydro_8 5 131 
385; 



DEX0449_00 
3 . aa . 3 



N 



0 - ol 
525; 



PKC_PHOSPHO_SITE 
79-81; MYRISTYL 
198-203; 

CK2_PHOSPHO_SITB 
88-91; MYRISTYL 
189-194; 

PKC_PHOSPHO_SITE 
344-346; 

PKC_PHOSPHO_SITE 
120-122; 

PKC_PHOSPHO_S ITE 
13-15; MYRISTYL 
335-340; MYRISTYL 
486-491; 

CK2_PHOSPHO_SITE 
49-52: 



BRCT 337-383; 
Glyco__hydro_85 131 
414; 
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226, 1.126; 


^K2_PHOSPHO_SITE 










105- 


428-431; 










121,1.136; 


PKC_PHOSPHO_SITE 










239- 


49-51; 










248, 1- 148; 


CK2_PHOSPHO_SITE 










251- 


30-33; MYRISTYL 










278,1.154; 


416-421; 










142- 


CK2_PHOSPHO_SITB 










174, 1. 207; 


89-92; 










82-97,1.113; 












59-75,1.139; 












334- 












340,1.063; 












462- 












483,1.166; 












422- 












438,1.104; 












323- 












332, 1.205; 












201- 












207,1.14; 4- 












11,1.111; 












387- 












408, 1.195; 












11-18,1.089; 
270- 

289, 1.165; 


CK2_PHOSPHO_SITE 
175-178; MYRISTYIi 
37-42; 

CK2 PHOSPHO_SITE 
216-219; MYRISTYL 
272-277 ; 

PKC_PHOSPHO_SITE 
182-184; 

PKC_PHOSPHO_SITE 










90-97,1.156; 
134- 

143,1.053; 
29-36,1.1; 
60-86,1.143; 
44-57, 1.084; 
317- 

324, 1.169; 
223- 

250,1.179; 
120- 

132,1.108; 
146- 

171,1.237; 
298- 

314,1.155; 
182- 

214,1.168; 
102-115,1.1; 




DEX044 9_00 
3 . aa . 4 


N 


0 - Ol- 
327; 


218-220; 

CK2_PHOSPHO_SITE 
218-221; 

PKC_PHOSPHO_SITE 

1 O O "1 O A • 

LZA - x z*± ; 

FlsX* jrrlU oirrlL/ o± 1x5 










1 C C ICQ. 

L-iN^s irxlvJo fxlw o ± lis 

98 - 101 ; 

PKC_PHOSPHO_SITE 
118-120; MYRISTYL 
9-14; 










119- 


PKC_PHOSPHO_SITE 










143,1.143; 


90-92; 










355- 


PKC_PHOSPHO_SITE 










371,1.155; 


8-10; 










280- 


CK2_PHOSPHO_SITE 




DEX0449_00 
3.orf .4 


N 


0 - Ol- 
384; 


307,1.179; 
177- 

189,1.108; 
95- 

112,1.089; 
327- 

346,1.165; 
374- 


273-276; 

PKC_PHOSPHO_SITE 
175-177; 

CK2_PHOSPHO_JSITE 
232-235; 

PKC_PHOSPHO_SITB 
179-181; MYRISTYL 
329-334.- 
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381,1.169; 

85-90,1.061; 

239- 

271,1.168; 
191- 

200, 1 . 053 ; 

159-172,1.1; 

203- 

228, 1. 237; 
147- 

154,1.156; 
4-64,1.135; 


CK2_PH0SPH0_S ITE 
155-158; 

CK2_PHOSPHO_SITE 
275-278; MYRISTYIj 
94-99; 

PKC_PHOSPHO_SITE 
239-241; MYRISTYL 
81-86; 

PKC_PHOSPHO_SITE 
275-277; 

CK2_PH0SPHO_SITE 
90-93; MYRISTYL 
47-52; 

PKC_PHOSPHO_SITE 
213-215; 








DEX0449_00 
3 . orf . 6 


N 


0 - ol- 
450; 


147- 

154,1.156; 
95- 

112, 1.089; 
255- 

269,1.178; 

159-172,1.1; 

421- 

437,1.155; 
292- 

337,1.244; 

4-64,1.135; 

440- 

447,1.169; 
239- 

250, 1.08; 
393- 

412, 1.165; 
177- 

189,1.108; 
191- 

200 1 053? 

85-90,1.061; 

119- 

143, 1.143 ; 
346- 

373, 1.179; 
271- 

278,1.063; 
203- 

228,1.237; 


CK2_PHOSPHO_SITE 
155-158; 

PKC_PH0SPHO_SITE 
90-92; 

CK2_PHOSPHO_SITE 
90-93; 

CK2 PHOSPHO_SITE 
232-235; 

PKC_PHOSPHO_SITE 
8-10; MYRISTYL 
47-52; MYRISTYL 
252-257; MYRISTYL 
287-292; 

PKC_PH0SPHO_S I TE 
179-181; MYRISTYL 
395-400; 

PKC_PHOSPHO_SITE 
341-343; 

CK2_PHOSPHO_SITE 
341-344; 

PKC_PHOSPHO_SITE 
213-215; 

CK2_PHOSPHO_SITE 
339-342; 

PKC_PH0SPHO_SITE 
239-241; MYRISTYL 
81-86; MYRISTYL 
94-99; 

PKC_PH0SPHO_SITE 
175-177; 




DEX0449_00 
3 . aa . 7 


N 


0 - Ol- 
302; 


82-97, 1.113; 
292- 

299,1.169; 
273- 

289,1.155; 
20-26, 1.083; 
124- 

130,1.123; 
105- 

121,1.136; 
239- 

248, 1.148; 
182- 


PKC_PHOSPHO_SITE 
79-81; 

CK2_PHOSPHO_SITE 
30-33; MYRISTYL 
189-194; 

PKC PHOSPHO_SITE 
120-122; MYRISTYL 
198-203; 

CK2_PHOSPHO_SITE 
49-52; 

PKC PHOSPHO_SITB 
49-51; 

CK2 PHOSPHO SITE 


Gly co_hydr o_8 5 131- 
297; 
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1 : 


L91, 1.186; i 


38-91; 










251- 1 


?KC_PHOSPHO_SITE 










264,1.138; 


L3-15; 










201- ( 


:k2_phospho_s ite 










207,1.14; 


39-92; 










213- 












226, 1.126 ; 












59-75,1.139; 












4-11,1.111; 












142- 












174, 1. 207; 














PKC_PHOSPHO_SITE 








1 - ol- 


4-19,1.198; 


87-89; 




TM-iV /\ A A C\ A A 

DEX0449_00 
4 . aa . 1 


y 


62; tm63 

85;i86- 
162; 


55-67,1.104; 

22-48,1.14; 

69- 

116,1.236; 


CK2_PH0SPH0_SITE 
98-101; 

CK2_PHOSPHO_SITE 
135-138; MYRISTYL 
128-133; 


Sterol deaat 1-129 • 
SUR2_D0MAIN 18-126; 










CK2_PHOSPHO_J3ITE 










186- 


100-103; 










197, 1.16; 


PKC_PHOSPHO_SITE 










36-45,1.101; 


7-9; 










97- 


CK2J?HOSPHO_SITE 




DEX0449_00 
5 . aa , 1 


N 


0 - ol- 
206 ; 


119,1.199; 
163- 

173,1.076; 
142- 

147,1.042; 
8-31, 1.142; 
58-76,1.16; 


42-45; 

PKC_PHOSPHO_SITE 
194-196; 

PKC_PHOSPHO_S ITE 
42-44; MYRISTYL 
75-80; 

PKC_PHOSPHO_SITE 
183-185; 










156- 












178 , 1 . 199 ; 


MYRISTYL 261-266; 










349- 


PKC_PHOSPHO_S ITE 










355, 1 . 062; 


101-103; 










243- 


CK2 PHOSPHO_SITE 










259,1.168; 


40-43; MYRISTYL 










117- 


52-57; 










135,1.16; 


PKC_PHOSPHO_SITE 










95- 


310-312; 










104, 1.101; 


PKC_PHOSPHO_SITE 










4-10,1.175; 


332-334; 










275- 


CK2JPHOSPHO_SITE 










293, 1.148; 


333-336; 




DEX0449_00 
5 . orf . 1 


NT 


0 - ol- 
380; 


360- 

371,1.088; 

43-54,1.098; 

315- 

322, 1.094; 
217- 

235, 1.131; 
296- 

304, 1.127; 

17-31,1.101; 

261- 

267,1.072; 
201- 

206, 1.042; 
67-90,1.142; 


CK2_PHOSPHO_SITE 
101-104; 

PKC_PHOSPHO_SITE 
66-68; 

CK2_PHOSPHO_J5ITE 
310-313; 

CK2_PHOSPHO_SITE 
3 02-3 05; MYRISTYI 
134-139; 

CK2_PHOSPHO_SITE 
159-162; 

PKC_PHOSPHO_SITE 
9-11; MYRISTYL 
221-226; 


i 
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DEX0449__00 
5 .orf .2 



N 



DEX0449_00 
5 . aa . 3 



217- 

235,1.131; 
296- 

304,1.127; 
156- 

178,1.199; 
95- 

104,1.101; 
349- 

355,1.062; 
67-90,1.142; 
17-31,1.101; 
201- 
01- 206,1.042 
380; 243- 

259,1. 168 
261- 

267,1.072 
43-54,1.098; 
315- 

322,1.094; 
4-10,1.175; 
275- 

293,1.148; 
360- 

371,1.088; 
117- 

135,1.16; 



N 



334; 



DEX0449_00 
5. orf .3 



Ol- 



0 - ol 
327; 



PKC_PHOSPHO_SITE 
101-103; 

PKC__PHOSPHO_SITE 
66-68; 

CK2_PHOSPHO_SITE 
310-313; 

CK2_PHOSPHO_SITE 
302-305; MYRISTYL 
261-266; 

PKC_PHOSPHO_SITE 
310-312; MYRISTYL 
134-139; 

CK2_PHOSPHO_SITE 
159-162; MYRISTYL 
52-57; 

PKC_PHOSPHO_SITE 
9-11; 

CK2_PHOSPHO_SITE 
101-104; 

CK2_PHOSPHO_SITE 
40-43; 

CK2_PHOSPHO_SITB 
333-336; 

PKC_PH0SPHO_SITE 
332-334; MYRISTYL 
221-226; 



94- 
105,1. 
46-52, 
114- 
124,1. 
270- 
275,1. 
225- 
247,1. 
314- 
325,1. 
186- 
204,1. 
291- 
301,1. 
136- 
159,1. 
10-16, 
18-37, 
164- 
173,1. 
76-86, 
59-65, 



164; 
1.071; 

143; 

042; 

199; 

16; 

16; 

076; 

142; 
1.095; 
1.212; 

101; 
1.121; 
1.085; 



307- 

318,1. 

64-82, 

243- 

251,1. 

262- 

269.1. 



CAMP_PHOSPHO__SITE 
73-76; MYRISTYL 
95-100; 

PKC_PHOSPHO_SITE 
135-137; 

CK2_PHOSPHO_SITE 
170-173; MYRISTYL 
85-90; 

PKC_PHOSPHO_SITE 
170-172; 

PKC_PHOSPHO_SITB 
322-324; MYRISTYL 
71-76; MYRISTYL 
51-56; MYRISTYL 
31-36; 

PKC_PHOSPHO_SITE 
311-313; 

PKCJ?HOSPHO_SITE 
28-30; MYRISTYL 
82-87; MYRISTYL 
46-51; MYRISTYL 
203-208; 

CK2_PHOSPHO_SITE 
228-231; 



088; 
1 . 16 i 

127; 

094 : 



MYRISTYL 168-173 
CK2_PHOSPHO_SITE 
257-260; MYRISTYL 
81-86; 

PKC_PHOSPHO_SITE 
257-259; 
CK2 PHOSPHO SITE 



J 
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1 CA 

Xo4 - 


^ov-283; MYRISTYL 










1QO "1 m . 












1 AO — 












loo / X ■ u*± *s ; 


A Q CI. 










AO c i i i ni • 
*± z - o x , l . i u i ; 














XUO-XQ9 ; 










302 1 062 • 


PVO PHOSPHO STTE 










222- 


3-6; 










240, 1.148; 


CK2_PHOSPHO_SITE 










14-37, 1. 142; 


249-252; 










208- 


PKC_PHOSPHO_SITE 










214, 1.072; 


48-50; 










190- 


PKC_PHOSPHO_SITE 










206, 1.168; 


279-281; 










103- 


PKC_PHOSPHO_SITE 










125,1.199; 


13-15; 












P KC_PHOS PHO_S I TE 












42-44; 












ASN_GLYC0SYLATION 










x • xux; 


1 CA 1 C *7 . MVOTOnTVT 










142- 


75-80; 




DEXO 449_00 
5 . aa . 4 


N 


0 - ol- 
161; 


147, 1 . 042 ; 
58-76 , 1 . 16 ; 
97- 

119, 1 . 199; 
8-31, 1.142; 


CK2 PHOSPHO SITE 
42-45 ; 

PKC PHOSPHO SITE 
156~158; ^ 
PKC_PHOSPHO SITE 
7-9; 

CK2 PHOSPHO SITE 
100-103 ; 










156 - 












1 »70 i ion 

178 , 1 . 199 ; 












217- 












235,1.131; 


MYRISTYL 261-266; 










43-54,1.098; 


PKC_PHOSPHO_SITE 










67-90, 1. 142; 


9-11; MYRISTYL 










296- 


221-226; 










314,1.123; 


PKC_PHOSPHO SITE 










261- 


66-68; 










267,1.072; 


CK2_PH0SPH0_SITE 




DEX0449_00 


N 


0 - ol- 


117- 


101-104; MYRISTYL 




S.orf .6 


326; 


135,1.16; 
one _ 

293,1.148; 
201- 

206,1.042; 
243- 

259,1.168; 
17-31,1.101; 
4-10, 1.175; 
95- 

104,1.101; 


134-139; MYRISTYL 
j<6 d / ; 

PKC_PH0SPH0_SITE 
101-103; 

CK2_PHOSPHO_SITE 
4 0-43; 

CK2_PHOSPHO SITE 
159-162; 




DEX0449_00 
6 . aa . 1 


N 


0 - ol- 
216; 


23-80,1.213; 
88-99,1.097; 
123- 

140,1.268; 
9-15, 1.084; 
109- 

116,1.12; 
142- 


CK2_PHOSPHO SITE 
89-92; 

PKC_PHOSPHO_SITE 
99-101; MYRISTYL 
84-89; 

CAMP_PHOSPHO_SITE 
79-82; 


PRO_RICH 31-209; 
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207,1.19; 






DEX044 9 00 

7 . aa . 1 


sr 


U — O-L 

108; 


4-10,1.067; 

46-52,1.055; 

12-30,1.269; 

59-105,1.18; 

39-44,1.054; 


PKC_PHOSPHO_SITE 

PKC_PHOSPHO_SITE 
29-31; 


SMALL CYTKCXC 75-83; 
SMALLCYTKCXC 90- 
108; 


DEX0449_00 
7 . orf . 1 


y 


0 - ol- 
142; 


67-73,1.058; 
46-58,1.043; 
93-139, 1. 18; 
4-30,1.163; 
37-44,1.048; 


PKC PHOSPHO_SITE 
10-12; MYRISTYL 
7-12; 


SMALLCYTKCXC 124- 
142; INTERLEUKIN8 
125-142; 

INTERLEUKIN8 102- 
125; SMALLCYTKCXC 
109-117; 


DEX0449_00 
8 . aa . 1 


M 


0 - ol- 
482; 


298- 

306,1.134; 
176- 

183,1.113; 

16-33,1.107; 

325- 

341,1.104; 

38-45,1.113; 

373- 

394,1.127; 
285- 

294,1.256; 
257- 

275,1.079; 
354- 

361,1.095; 
230- 

254,1.098; 

52-58,1.067; 

121- 

128,1.113; 

7-14,1.136; 

110- 

119,1.111; 
62- 

101,1.184; 


MYRISTYL 44-49; 
ASN_GL YCOS YLAT I ON 
317-320; 

PKC_PHOSPHO_SITE 
48-50; 

ASN_GLYCOSYLATION 
285-288; MYRISTYL 
43-48; 

CK2_PH0SPH0_SITE 
141-144; 

TYR_PHOSPHO_SITE 
50-56; MYRISTYL 
96-101; 

ASN GLYCOSYLATION 
69-72; MYRISTYL 
258-263; MYRISTYL 
471-476; 

ASN — GLYCOSYLATION 
37-40; 

PKC_PHOSPHO_S ITE 
250-252; 

CK2_PHOSPHO_S ITE 
204-207; MYRISTYL 
435-440; 

CK2_PHOSPHO_SITE 
85-88; MYRISTYL 

465- 470; 

CK2_PHOSPHO_SITE 
192-195; 

PKC_PHOSPHOjSITE 

466- 468; MYRISTYL 
118-123; MYRISTYL 
439-444; 

CK2_PHOSPHO_SITE 
149-152; 


RRM 332-405; RRM 
331-409; NTF2 11- 
133; GLY_RICH 419- 
479; NTF2_D0MAIN 
11-133; rrm 333- 
404; 

sp_Q9UPAl_Q9UPAl_HU 
MAN 11-132; 
GLUJRICH 134-223; 


DEX0449_00 
8. orf .1 


N 


0 - ol- 
392; 


4-12, 1.235; 
264- 

271,1.095; 
283- 

304,1.127; 
' 235- 
251,1.104; 
86-93,1.113; 
37-45,1.123; 
140- 

165.1.098: 


CK2_PHOSPHO_SITE 
59-62; RGD 221- 
223; MYRISTYL 
168-173; 

CK2_PHOSPHO_SITE 
114-117; 

CK2_PHOSPHO_SITE 
102-105; MYRISTYL 
345-350; 

PKC_PHOSPHO_SITE 
J376-378 : 


RRM 242-315; 
HUDSXLRNA 246-261; 
HUDSXLRNA 261-273; 
rrm 243-314; 
PRICHEXTENSN 101- 
122; PRICHEXTENSN 
194-219; GLUJRICH 
52-133; 

PRICHEXTENSN 164- 
181; GLY_RICH 329- 
389: RRM 241-319: 
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16-33, 1.144; 


PKCJPHOSPHO_SITE 1 


PR I CHEXTENSN 136- j 








167- 


160-162; MYRISTYL 


152; PR0_RICH 138- 








201,1.142; 


375-380; | 
ASN_GLYCOSYLATION 
227-230; MYRISTYL 
349-354; 

CK2_PHOSPHO_SITE 
51-54; MYRISTYL 
381-386; 


219; 








159- 


CK2_PHOSPHO_SITE 










175,1.104; 


71-74; 










233- 


CK2_PHOSPHO_SITE 










239,1.056; 


16-19; MYRISTYL 










94- 


269-274; 










107,1.098; 


ASNJ3LYCOSYLATION 


RRM 166-239; rrm 








207- 


151-154; MYRISTYL 


167-238; HUDSXLRNA 


DEX0449_00 
9 . aa . 1 




0 - Ol- 
316 ; 


228,1.127; \ 


305-310; 


170-185; HUDSXLRNA 


N 


109- 

128,1.075; 
30-37,1.065; 
5-15,1.225; 
188- 

195,1.095; 

41-50,1.113; 

135- 

140, 0 . 986 ; 


PKC_PH0S PHO — SITE 
300-302; 

CK2_PHOSPHO_SITE 
14-17; MYRISTYL 
299-304; MYRISTYL 
273-278; RGD 145- 
147; 

CK2 PHOSPHO SITE 
59-62; 


185-197; GLU_RICH 
17-90; RRM 165-243; 
PRO_RICH 95-143; 
GLY_RICH 253-313; 










MYRISTYL 267-272; 












MYRISTYL 271-276; 












ASN_GLYC0 S YLATI ON 












149-152; MYRISTYL 












2-7 ; 












ASN GLYCOSYLATION 
4-7T RGD 143-145; 
PKC PHOSPHO SITE 
298-300; 

CK2_PH0SPHO_SITE 
57-60; MYRISTYL 
297-302; 

CK2_PHOSPHO_SITE 


HUDSXLRNA 183-195; 
GLU_RICH 15-88; j 
PRO_RICH 93-141; 


DEX0449_00 
9 . orf . 1 


N 


0 - ol- 
314/ 




RRM 164-237; rrm 
165-236; GLY_RICH 
251-311; HUDSXLRNA 










168-183; RRM 163- 
241 ; 










14-17; MYRISTYL 










303-308; 












CK2_PHOSPHO_SITE 












12-15; 












CK2_PH0SPHO_SITE 












69-72; 












MYRISTYL 126-131; 










41-48,1.095; 


MYRISTYL 122-127; 


RRM 19-92; 








125- 


MYRISTYL 152-157; 


HUDSXLRNA 23-3 8; 


DEX0449_00 
9 . aa . 2 


N 


0 - il- 
169; 


130,1.013; 
11-28,1.104; 
86-92,1.056; 
55-82,1.127; 


MYRISTYL 158-163; 
ASN GLYCOSYLATION 
4-7; 

PKC_PHOSPHO_SITE 
153-155; 


GLY_RICH 106-166; 
RRM 18-96; 
HUDSXLRNA 38-50; 
rrm 20-91; 








85-99,1.144; 


PKC_PHOSPHO_SITE 




DEX0449_01 
0 . aa . 1 




0 - il- 
147; 


9-23,1.113; 


43-45; 


Ribosomal_S17e 1- 


N 


66-73, 1.073; 


PKC_PHOSPHO_SITE 


122; RIB0S0MAL_S17E 




51-59, 1.107; 
106- 


8-10; MYRISTYL 
112-117; 


41-56; 
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DEX0449_01 
O.orf .1 



0 - ol 
166; 



112,1.085; 
114- 

134, 1.13; 
31-44, 1.124; 



PKCJ?HOSPHO_SITE 
70-72; 

TYR_PHOSPHO_SITE 
14-21; 

PKC_PHOSPHO_SITE 
30-32; 

CK2_PH0SPH0_SITE 
89-92; 

PKC_PHOSPHO_SITE 
142-144; 



PKC_PHOSPHO_SITE 
27-29; 

PKC_PHOSPHO_SITE 
89-91; 

PKC_PHOSPHO_SITE 
62-64; 

PKC_PH0SPH0_S I TE 
161-163; 

|TYR_PHOSPHO_SITE 
33-40; 

PKC_PHOSPHO_SITE 
49-51; 

CK2_PHOSPHO_SITE 
108-111; MYRISTYL 

131- 136; 

PKC_PHOSPHO_SITE 

132- 134; 



Ribosomal_S17e 20- 
141; RIBOSOMAL_S17E 
60-75; 



DEX0449_01 
l.aa.l 



N 



0 - ol- 
254; 



90- 

105,1.105; 
221- 

232, 1.115; 
213- 

219,1.125; 

15-32,1.096; 

160- 

173,1.13; 
239- 

247,1.187; 
77-87,1.09; 



CK2_PHOSPHO__SITE 
209-212; 

CAMP__PHOSPHO_SITE 
7-10; 

CK2_PHOSPHO_SITE 
35-38; 

CK2_PHOSPHO_SITE 
233-236; 

ASN_GLYCOSYLATION 
192-195; 

PKC_PHOSPHO_SITE 
35-37; 

ASN^GLYCOSYIATION 
223-226; 

CK2_PHOSPHO_SITB 
11-14; 

PKC_PH0SPHO_SITE 
67-69; 

CK2JPHOSPHO_SITE 
107-110; MYRISTYL 

64- 69; 

CK2_PHOS PHO_S ITE 
10-13; 

PKC_PHOSPHO_SITB 
25-27; 

CK2_PHOSPHO_SITE 
123-126; 

PKCJ?HOSPHO_SITE 
92-94; 

ASN__GLYCOSYLATI0N 

65- 68; 

|TYR PHOSPHO SITE 



TFIIS 216-251; 
ZnF_C2C2 214-253; 
TFS2M 91-192; TFSII 
1-254; TFIIS 214- 
252; 
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DEX0449_01 
l.orf .1 



0 - ol 
302; 



134-140; 

CK2_PHOSPHO_SITE 
177-180; 

CK2_PHOSPHO_SITE 
88-91; MYRISTYL 
207-212; 



208- 

221, 1.13; 
261- 

267,1.125; 

4-20,1.264; 

125- 

134,1.09; 
138- 

148, 1. 105; 
63-76, 1.096; 
25-32, 1.092; 
269- 

280, 1.115; 
38-45, 1.077; 
287- 

294, 1.187 ; 



CAMP_PHOSPHO_S ITE 
55-58; 
ASN_GLYCOSYLft£TION 
271-274; 

CK2_PHOSPHO_SITE 
59-62; 

PKC_PHOSPHO_SITE 
140-142; 

CK2_PHOSPHO_SITE 
83-86; 

TYR_PHOSPHO_SITE 
182-18 8; 

CK2_PHOSPHO_SITE 
281-284; 

CK2_PHOSPHO_SITE 
58-61; MYRISTYL 
255-260; 

CK2_PHOSPHO_SITE 
136-139; 

ASN_GLYCOSYLATION 
240-243; 

CK2_PHOSPHO_SITE 
171-174; 
PKC_PHOSPHO_SITE 
44-46; 

CK2_PHOSPHO_SITE 
225-228; 

PKC_PHOSPHO_S'ITE 
115-117; 

CK2_PHOSPHO_SITE 
257-260; 

PKC_PHOSPHO_SITE 
83-85; MYRISTYL 

112- 117; 

ASN__GLYCO S YLATI ON 

113- 116; 

PKC_PHOSPHO_SITE 
73-75; MYRISTYL 
48-53; 

CK2_PHOSPHOjS ITE 
155-158; 



TFSII 2-302; TFIIS 
264-299; TFS2M 139- 
240; TFS2N 12-80; 
ZnF_C2C2 262-301; 
TFIIS 262-300; 



DEX044 9_01 
2 . aa . 1 



N 



0 - ol 

225; 



92-98, 1.091; 
159- 

168,1.118; 
34-43, 1.152; 
132- 

138,1.052; 
113- 

125,1.136; 

63-71,1.115; 

4-26,1.152; 

73-89,1.107; 

100- 



MYRISTYL 110-115 
CK2_PHOSPHO_S ITE 
114-117; 

CK2_PHOSPHO_S ITE 
3-6; 

CK2_PHOSPHO_SITB 
182-185; 
PKC_PHOSPHO_SITE 
63-65; MYRISTYL 
39-44; 

PKCJ?HOSPHO_S ITE 
48-50; 



HAD-SF-IB 16-190; 
serB 1-22 0; 
Hydrolase 14-200; 
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110,1.184; 
186- 

195,1.089; 

54-61,1.121; 

213- 

222,1.101; 



DEX0449_ 
3 .aa. 1 



01 



N 



0 - ol- 
249; 



33-43,1.16; 
210- 

218,1.114; 
198- 

208,1.067; 
133- 

155, 1.217; 
163- 

191,1.174; 
74-80, 1.077; 
113- 

123,1.179; 
232- 

246,1.117; 
83- 

106,1.227; 
54-71,1.119; 
21-28,1.053; 
125- 

131,1.082; 



PKC_PH0SPHO_SITE 
16-18; 

CK2_PH0SPH0_SITE 
216-219; MYRISTYL 
41-46; 

PKC_PHOSPHO_SITB 
204-206; MYRISTYL 
212-217; 

PKC_PHOSPHOjSITE 
15-17; MYRISTYL 
33-38; 

CK2_PH0SPHO_SITB 
47-50; 

TYR_PHOSPHO_SITE 
61-68; 

CK2_PHOSPHO_SITE 
162-165; MYRISTYL 
138-143; 



DEX0449_01 
4 .aa. 1 



0 - ol- 
113; 



31-45, 1.147; 
57-80,1.147; 



PKC_PHOSPHO_SITE 
23-25; 

CK2_PHOSPHO_SITE 
107-110; 

PKC_PHOSPHO_SITE 
17-19; 

CK2_PH0SPH0_SITE 
98-101; 

PKC_PH0SPHO_SITE 
8-10; 

CK2_PHOSPHO_SITE 
54-57; 

ASN_GLYCOSYLATION 
52-55; 

CK2_PHOSPHOJ5ITE 
17-20; 

ASN_GLYCOS YLATI ON 
14-17; 



RCC1_2 66-76; 



DEX0449_01 
5 . aa . 1 



N 



0 - ol- 
102; 



83-89, 
17-38, 
43-51, 
65-78, 
91-99, 



1.072; 

1.144; 

1.06; 

1.148; 

1.076; 



PKC_PHOSPHO_SITE 
59-61; 



DEX0449_ 
6 . aa . 1 



01 



0 - ol- 
412; 



362- 

91,1. 
263- 
294,1. 
297- 
318,1. 
351- 
359,1. 
329- 



147; 
164; 
186; 
147; 



MYRISTYL 185-190; 
MYRISTYL 226-231; 
CK2_PH0SPH0_SITE 
134-137; 

PKC_PHOSPHO_SITB 
398-400; MYRISTYL 
38-43; MYRISTYL 
383-388; 

CK2 PHOSPHO SITE 



CYTO CHROME_P 4 5 0 
348-357; 

PRICHEXTENSN 46-67; 
EP450I 270-288; 
P450 355-366; 
PRICHEXTENSN 204- 
229; EP450I 345- 
355; EP450I 173- 
190: P450 271-282: 
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DEX0449_01 
6 . orf . 1 



N 



0 - ol 
203; 



36-45,1.08; 
194- 

200,1.179; 

22-34,1.085; 

88- 

109,1.186; 

8-14,1.041; 

153- 

182,1.147; 
142- 

150,1.147; 
120- 

140,1.091; 
54-85, 1.164; 



349,1.091; 


290-293; 


403- 


AMIDATION 350- 


409,1.179; 


353; 


44-70,1.113; 


CK2_PHOSPHO_S ITE 


245- 


151-154; 


254,1.08; 


PKC_PHOSPHO_SITE 


191- 


257-259; MYRISTYL 


201, 1.112; 


42-47; MYRISTYL 


233- 


222-227; 


243, 1.061; 


PKC PHOSPHO SITE 


84- 


2-4; 


125, 1.211; 




171- 




182,1.135; 




23-36, 1.193; 




5-21,1.155; 




127- 




159,1.183; 





EP450I 193-219; 

P450 184-201; p450 
-408; EP450I 355- 
78; EP450I 310- 
34; P450 346-355; 



PKC_PHOSPHO_SITE 
48-50; 

PKC_PHOSPHO_SITE 
15-17; AMIDATION 
141-144; 

CK2_PHOSPHO_SITE 
81-84; 

CK2_PHOSPHO_SITE 
15-18; MYRISTYIi 
174-179; 

PKC_PHOSPHO_SITE 
189-191; 



EP450I 18-44; P450 
27-40; BP450 146- 
157; P450 146-157; 
EP450I 61-79; BP450 
137-146; P450 62- 
73; CYTOCHROME_P4 5 0 
139-148; EP450IV 
57-73; EP450IV 146 
164; EP450I 146- 
169; EP450IV 130- 
146; EP450I 136- 
146; EP450I 101- 
125; BP450 79-106; 
EP450IV 106-124; 
P450 137-146; BP450 
62-73; 



DEX0449_01 
7 . aa . 1 



il- 



116; 



74-87,1.16; 
99- 

108,1.095; 

4-31,1.198; 

48-71,1.145; 



|ASN__GLYC0 S YLAT ION 
94-97; 
ASN_GIiYCOSYIiATION 
64-67; 

PKC_PHOSPHO_SITE 
98-100; 



DEX0449_01 
7. orf .1 



N 



0 - ol 
141; 



126- 

138,1.16; 

25-30,1.033; 

100- 

123, 1. 145; 

38-44,1.04; 

73-80,1.087; 

87-93,1.083; 

11-21,1.245; 

58-68,1.175; 



PKC_PHOSPHO_SITE 
43-45; RGD 58-60; 
ASN_GLYCOSYLATION 
116-119; 

AMIDATION 21-24; 
PKC_PHOSPHO_SITE 
31-33; MYRISTYIi 
92-97; MYRISTYIi 
25-3 0; MYRISTYL 
21-26; 



DEX0449_01 
8 . aa . 1 



N 



0 - ol 
426; 



5-43,1.154; 
106- 

116,1.117; 

90.-95,1.044; 

356- 

363,1.107; 
224- 

231.1.101: 



CAMP_PHOSPHO_SITE 
329-332; 

CK2_PHOSPHO_SITE 
377-380; 

PKC_PHOSPHO__SITE 
229-231; 

PKC_PHOSPHO_SITE 
328-330: 



ZINC_FINGBR_C2H2_1 
74-96; 
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322- 


:k2_phospho_site 










328, 1 .077; 


135-138; 










70-80, 1 . 145; 


^K2_PHOSPHO_SITE 










293- 


75-78; 










299, 1 . 08 ; 


rYR_PHOSPHO_SITE 










205- 


264-272; 










221, 1.197; 


CK2_PHOSPHO_SITE 










172- 


120-123; 










187, 1.186; 


CK2_PHOSPHO_SITB 




* 






192- 


137-140; 










199, 1.115; 


PKC_PHOSPHO_SITE 










271- 


368-370; 










282 , 1 . 08 ; 


CAMP_PHO S PHO_S I TE 










154- 


241-244; 










169, 1 . 132; 


CK2_PHOSPHO_SITE 










401- 


128-131; MYRISTYL 










418 . 1 . 128 : 


129-134; RGD 316- 










59-65, 1 . 037; 


318 ; 










45-50, 1 . 049 ; 


PKC_PHOSPHO_SITE 










333 - 


69-71 ; 










351 . 1 . 138 : 


CK2 PHOSPH0_SITE 












111-114 ; 












PKC PHOSPHORS I TE 












3 01-303 ; 












PKC PHOSPHORS ITE 












398-400; MYRISTYL 












243-248 ; 












CK2 PHOSPHORS ITE 












332-335; 












PKC_PHOSPHO_SITE 












237-239; MYRISTYL 












247-252; 












CAMP_PHOS PHO_S I TE 












374-377; 












CK2_PH0SPH0_SITE 












15-18; 












CK2_PH0SPH0_SITB 












368-371; 










237- 


rpVO TiUr\ O OTIA O T r PT3* 

TiR iMiUb FxiU o 11a 










243 , 1 . Uo ; 


«uo aIo ; 










215- 


CK2_PH0SPH0_S ITE 










226 ,1.08; 


276-279; 










345- 


CK2_PH0SPH0_SITE 










362 , 1 . 128 ; 


72-75; 










168- 


CK2_PHOSPHO__SITE 










175 , 1 . 101; 


64-67 ; 










266- 


PKC PHOSPHO_SITE 








0 - ol- 


272 . 1 . 077 : 


342-344 ; 




DEX0449_01 


N 


98- 


CAMP PHOSPHO SITE 




8 . aa . 4 


370; 


113,1.132; 


185-188; 










116- 


CK2_PH0SPH0_SITE 










131,1.186; 


79-82; 










50-60,1.117; 


PKC_PHOSPHO_SITE 










5-39,1.125; 


181-183; 










149- 


CK2_PHOSPHO_SITE 










165,1.197; 


55-58; 










136- 


PKC PHOSPHO SITE 










143,1.115; 


312-314; 










300- 


PKC PHOSPHO SITE 
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307,1.107; 


30-32; 










277- 


CK2JPHOSPHO_SITE 










295, 1. 138; 


81-84; MYRISTYL 
73-78; 

CK2_PH0S PHO_S ITE 
30-33; MYRISTYL 
191-196; 

PKC_PHOSPHO_SITE 
245-247; 

CK2_PH0SPH0_SITE 
321-324; 

CAMP PHOSPHO SITE 
273-276; 

CK2_PHOSPHO_SITE 
312-315; 

272-274; RGD 260- 
262 ; 

CK2 PHOSPHO SITE 
15-18; MYRISTYL 
187-192 ; 

PKC PHOSPHO SITE 
173-175 ; 

CAMP_PHOSPHO_SITE 
318-321; 












CK2_PHOSPHO_SITE 












111-114; 












CK2_PHOSPHO_SITE 










70-80,1.145; 


120-123; 










45-50,1.049; 


PKC PHOSPHO SITE 










59-65,1.037; 


69-71; 










90-95, 1.044; 


CK2_PHOSPHO_SITE 




DEX0449__01 
8 .aa. 5 




0 - ol- 
208; 

i 


169, 1 . 132 ; 
172- 

205 , 1 . 186 ; 
106- 

116 . 1 . 117 
5-43 1 154 ■ 


Jija-ijo; jyixKXoixjj 
129-134 ; 

CK2 PHOSPHO SITE 
75-78; 

CK2 PHOSPHO SITE 
128-131; 

CK2 PHOSPHO SITE 
15-18; 

CK2_PH0SPH0_SITE 
137-140; 


ZINC_FINGER_C2H2_1 
74-96; 








115- 












131,1.129; 


MYRISTYL 98-103; 










IT "3 Q 1 1 C C « 

17~j5 , 1 . 15o ; 


nv") nun O THJ/"\ PTfnn 


anx 1/9-203; 








57-80 . 1 . 144 ; 


50-53 ; 


ANK REPEAT 179-209; 








192-203,1.2; 


CK2_PH0SPH0_SITE 


ANKYRIN 192-204; 


DEX0449_01 


N 


0 - ol- 


181- 


107-110; 


ANK_REP_REGION 179- 


8 .orf . 7 


209; 


189, 1.123 ; 
100- 

106,1.067; 
140- 

178,1.174; 
4-13,1.164; 


CK2_PHOSPHO_SITE 
162-165; MYRISTYL 
203-208; 

CK2_PH0SPHO_SITE 
48-51; 


209; ANKYRIN 180- 
192; ANK 138-168; 
ANK 179-208; ank 
138-171; 








70-80,1.145; 


CK2_PHOSPHO_SITE 




DEX0449_01 
8 . aa . 7 


N 


0 - ol- 
125; 


90-95,1.044; 
45-50,1.049; 
5-43,1.154; 
59-65.1.037: 


121-124; 

CK2_PH0SPH0_SITB 
111-114; 

CK2 PHOSPHO SITE 


ZINC FINGER_C2H2_1 
74-96; 
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106- 

116,1.117; 



15-18; 

CK2_PHOSPHO_SITE 
120-123; 

PKC_PHOSPHO_S ITE 
69-71; 

CK2_PHOSPHO_SITE 
75-78; 



DEX0449_01 
9.orf .1 



N 



ol- 



191; 



45-58,1.139; 
90- 

111,1.126; 

63-86,1.209; 

178- 

184,1.058; 
115- 

135,1.185; 

10-37,1.175; 

140- 

154,1.113; 



PKC_PHOSPHO_S ITE 
165-167; 

CK2_PHOSPHO_SITE 
38-41; 

PKC_PHOSPHO_SITE 
184-186; 

PKC_PHOSPHO__SITE 
183-185; 

PKC_PHOSPHO_SITE 
57-59; 

PKC_PHOSPHO_SITE 
89-91; MYRISTYL 
15-20; MYRISTYL 
112-117; 

PKC_PHOSPHO_SITE 
84-86; _____ 



DEX0449_01 
9 . aa . 1 



0 - ol 
13 0; 



101- 

115,1.14; 
117- 

127,1.111; 

37-59,1.165; 

13-30,1.165; 

66-71,1.085; 

4-11,1.144; 

75-81,1.067; 



CK2_PHOSPHO_SITE 
79-82; AMI DAT I ON 
32-35; 

ASNJ3LYCOSYLATION 
63-66; MYRISTYL 
38-43; 
ASN_GLYCOSYLATION 
75-78; MYRISTYL 
7-12; 

PKC_PHOSPHO_SITE 
59-61; MYRISTYL 
2-7; 



DEX0449_02 
0 . orf . 1 



N 



0 - ol 
340; 



216- 

223,1.099; 
183- 

191,1.146; 
127- 

173,1.255; 
269- 

275,1.09; 
112- 

123,1.195; 
315- 

321,1.08; 
79- 

102,1.145; 
226- 

252,1.213; 
286- 

302,1.253; 

23-29,1.075; 

194- 

213,1.151; 
259- 

264, 1. 066; 
326- 



CK2_PHOSPHO_SITB 
223-226; MYRISTYL 
285-290; 

PKC_PHOSPHO_SITE 
267-269; 

TYR_PHOSPHO_S ITE 
322-329; 

PKC J?HOSPHO_S ITE 

17- 19; 

CAMP_PHOSPHO_SITE 

18- 21; MYRISTYL 
123-128; 
CAMP_PHOSPHO_SITE 

19- 22; 

CK2_PHOSPHO_SITE 
255-258; MYRISTYL 
171-176; 

PKC_PHOSPHO_S ITE 
21-23; 

CK2_PHOSPHO_SITE 
248-251; 



atninot ran_ 4 26-325; 
spJ29BTB6_Q9BTB6_HU 
MAN 192-296; 
AA_TRANSFER_CLASS_4 
212-246; 
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332,1.042; 












62-68,1.078; 












281- 












i nn • 

«0 / f 1 • uO r 
























235- jj 












241,1.09; 












78-89, 1.195; 


CK2_PH0SPH0_SITE 










192- 


189-192; 










218,1.213; 


PKC_PHOSPHO_SITE 










225- 


233-235; 










230,1.066; 


TYR_PHOSPHO_SITE 


sp_Q9BTB6_Q9BTB6_HU 


DEX0449_02 




0 - Ol- 


93- 


288-295; MYRISTYL 


MAN 158-262; 


N 


139,1.255; 


251-256; 


aminotran_4 1-291; 


0 . aa . 1 


306; 


45-68, 1.145; 


CK2_PHOSPHO_SITE 


AA TRANSFER CLASS 4 








292- 


221-224; MYRISTYL 


178-212; 








298,1.042; 


137-142; 










149- 


CK2_PHOSPHO_SITE 










157,1.146; 


214-217; MYRISTYL 










160- 


89-94; 










179,1.151; 












182- 












189,1.099; 












252- 












268,1.253; 












224- 












247,1.145; 


MYRISTYL 46-51; 










328- 


MYRISTYL 460-465; 










336, 1.146; 


PKC_PHOSPHO_SITE 










145- 


481-483 ; 










152,1.082; 


ASN_GLYCOS YLAT I ON 










339- 


178-181; MYRISTYL 










358,1.151; 


47-52; 










120- 


CK2_PHOS PHO_S ITE 










131,1.063; 


393-396; MYRISTYL 










207- 


55-60; 










213,1.078; 


PKC_PHOSPHO_SITE 










371- 


135-137; MYRISTYL 










397,1.213; 


316-321; MYRISTYL 










272- 


43-48; 


aminotran_4 170- 








318, 1.255; 


PKC_PHOSPHO_SITE 


453; 


DEX0449_02 


N 


0 - ol- 


361- 


124-126; MYRISTYL 


sp_Q9BTB6_Q9BTB6_HU 


0 . or f . 2 




485; 


368,1.099; 


77-82; 


MAN 337-401; 








176- 


PKC_PHOSPHO_SITE 


AA_TRANSFER_CLASS_4 








195,1.149; 


56-58; 


357-391; 








257- 


PKC_PHOSPHO_SITE 










268,1.195; 


450-452; MYRISTYL 










404- 


100-105; 










412,1.107; 


CK2_PHOSPHO_SITE 










464- 


450-453; 










476, 1.259; 


CK2_PHOSPHO_SITE 










104- 


368-371; MYRISTYL 










118,1.181; 


268-273; 










427- 


ASN_GLYCOSYLATION 










449,1.147; 


33-36; MYRISTYL 










84- 


76-81; 










102,1.102; 


CK2_PHOSPHO_SITE 










26-33,1.101; 


153-156; 








4-18,1.182; 
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DEX0449_02 
0 . aa . 2 



N 



0 - ol 
465; 



11-25, 


1.181; 




394- 






400, 1 . 


09; 




451- 






457,1. 


042; 




52-59, 


1. 082; 




164- 




ASN__GLYC0 S YLAT ION 


175, 1 . 


195; 


85-88; MYRISTYL 


319- 




7-12; 


324, 1. 


061; 


TYR_PHOSPHO_SITE 


371- 




447-454; 


389, 1 . 


133; 


CK2_PHOSPHO_SITE 


361- 




314-317; MYRISTYL 


368,1. 


14; 


175-180; 


246- 




CK2_PHOS PHO_S I TE 


265,1. 


151; 


275-278; 


278- 




CK2_PHOSPHO_SITE 


3 04, 1. 


213; 


60-63; 


114- 




CK2_PHOSPHO_SITE 


120,1. 


078; 


300-303; 


4-9,1. 


069; 


PKC_PHOSPHO_SITE 


179- 




31-33; MYRISTYL 


225,1. 


255; 


333-338; MYRISTYL 


235- 




323-328; MYRISTYL 


243,1. 


146; 


410-415; 


411- 




PKC_PHOSPHO_SITE 


427,1. 


253; 


42-44; MYRISTYL 


440- 




223-228; 


446,1. 


08; 


PKC_PHOSPHO_SITE 


83- 




392-394; 


102,1. 


149; 


CK2_PHOSPHO_SITE 


27-38, 


1.063; 


307-310; 


334- 






357, 1. 


22; 




268- 






275,1. 


099; 




131- 






154, 1. 


145; 





sp_Q9BTB6_Q9BTB6_HU 
MAN 382-421; 
Sp_Q 9 BTB 6_Q 9 BTB 6_HU 
MAN 244-310; 
ami no t r an_4 77-450; 
AA_TRANS FER_CLAS S_4 
264-298; 



DEX0449_02 
1 . orf . 1 



0 - ol 
74; 



|ASN_GLYC0 S YLATI ON 
7-10; MYRISTYL 8 
13; 

PKC_PHOSPHO_SITE 
11-13; 

PKC__PHOSPHO_SITE 
13-15; 

CK2JPHOSPHO_SlTE 
1-4; 

CAMP_PHOSPHO_SITE 
17-20; MYRISTYL 
32-37; 
ASN_GLYCOSYLATION 

10- 13; 

PKC_PHOSPHO_SITB 
39-41; 

PKC_PHOSPHO_SITE 
1-3; 

CK2_PHOSPHO_SITE 

11- 14; 

ASN GLYCOS YLAT ION 
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|DEX0449_ 
.aa.l 



02 



1 - il- 
12;tral3 

32;o33 
63; 



DEX0449_ 
2 . aa . 1 



02 



N 



50-60, 1.136; 
4-47, 1.195; 



0 - Ol- 
313; 



9-12; MYRISTYL 3- 
8; 

PKC_PHOSPHO_SITE 
12-14; 



119- 

137,1.178; 
217- 

247,1.231; 
76-83,1.102; 
24-30, 1.066; 
267- 

282,1.131; 
Be- 
lie, 1.315; 
250- 

257,1.128; 
294- 

310,1.104; 
149- 

155,1.04; 
165- 

210,1.137; 
139- 

147,1.071; 
65-71, 1.081; 
40-59, 1.204; 



PKC_PHOS PHO_S ITS 
12-14; MYRISTYL 
246-251; 

CK2_PHOSPHO_SITE 

10; MYRISTYL 
65-70; 

PKC_PHOSPHO_SITE 
7-9; 

CK2_PHOSPHO_SITE 
143-146; 

PKC_PHOSPHO_SITE 
118-120; 

PKC_PHOS PHO_S I TE 
269-271; 

CK2_PHOSPHO_SITE 
15-18; 

ASN_GLYCOSYLATION 
22-25; 

CK2_PHOSPHO_SITE 
255-258; 

CK2_PHOSPHO_SITE 
159-162; 

PKC_PHOSPHO_SITE 
147-149; 

CK2_PHOSPHO_SITE 
113-116; 

PKC_PHOSPHO_SITE 
193-195; 

CK2_PHOSPHOJ3ITE 
147-150; 

ASN_GLYCOSYLATION 
111-114; 

CK2_PHOSPHO_SITE 
269-272; 

PKC_PHOSPHO_SITE 
88-90; MYRISTYL 
289-294; 

PKC_PHOSPHO_SITE 
292-294; 

CK2_PHOSPHO_SITE 
61-64; 



FeJiydJLg_C 79-299 



DEX044 9_02 
2 . or f . 2 



0 - Ol 
395; 



345- 

392,1.137 
87- 

118,1.108 
245- 

251,1.081 
256- 

263,1.102 
220- 



CK2_PHOSPHO_SITE 
323-326; 

PKC_PHOSPHO_SITE 
78-80; 

PKC_PHOSPHO_SITE 
298-300; 

PKC_PHOSPHO_SITE 
46-48; MYRISTYL 
110-115: MYRISTYL 



Fe_hyd_lg_C 259- 
3 95; CLOACIN 159- 
18 0; CLOACIN 270- 
289; PRO_RICH 18- 
105; 
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DEX0449_02 
2 . aa . 2 



N 







I ZJ ZD \J , 1*1 X IvX O X X J-J 






1 77 - 


245-2*50 • 

J £t Zj \J g 






L** J , X • X / / 


"*K2 PHOSPHO SITE 








5 - 11 ; 






m i n7i • 


""•"{IMP PHOQPHO QTTF! 






fiOO" 








oqc t TIC. 


ASM GIjYCOSYIiATION 






iq.cQ t inc. 


291-294 ; 






1 04B • 
o «j -J / x • v*a o , 


CK2 PHOSPHO SITE 






171- 

X # X 


241-244 ; 






XOO,X*XX, 


CK2 PHOSPHO SITE 






on/ 


146-149 ; 






ZXU,X.UOO/ 


PAMP PHOSPHO SITE 








O U O J / 






ICQ 1 1 *7 A • 


niro PHOSPHO SITE 






cn-^K 1 flA • 


fil - ft • 






•500- 








3X / , X . X / o ; 


i?j i?o ; 






oZi? - 


r*Tf7 PWOSPHO QTTTSI 






i *a c i nA • 
33o , X . ; 


i q*7 — i on . 

XO / — 17U| 








PTCO PHOSPHO SITE 
















it J\\— rnuo Jt nj o lie 








1 Q7 - 1 QA • 

1 «7 a X J** , 








Jr XvC rfiuornu ox id 








Ofo _770 . 
zoo Z 'V, 








OTTO DHOQPWO STTT5 
v»ivz rnuijrnu oxxo 








j o3o ; i v i inioi x xj 








1A1 — lAfi ■ 

Xfi x — x*± d ; 








ftolN IJXJ X KJXJ O X XXfrl X XVJ1M 








*> n7 -one . 

Z UZ — fiU9 , 








i*j\z rnvornu oxxa 








7 q _ q a 7 • 








PKC PHOSPHO SITE 








373~375; MYRISTYL 








159-164; MYRISTYL 








173-178; 








CK2_PH0SPH0_SITE 








327-330; 








PKC_PHOSPHO_SITB 








327-329; 








PKCJ?HOSPHO_SITE 








187-189; 






i "a *a — 


\*rs~& a ti\jo e n\j ox x a 






OA 1 1 17. 

z^i / i. iz ; 


QA . Q 7 • 






1 — 


1 IK rnuornu Olla 






zu / , x • z us ; 


777-7*5 ** • 






IOC. 


P'KY"' PWOSPHO QTTR 
JriVw rnu jrfiv ox xxa 






1 OA 1 AQO , 


Z D U ZOZ; 1*1 x XV, X O X X U 






166- 


31-36; 




0 - ol- 


176,1.158; 


PKC PHOSPHO_SITE 


Fe_hyd_lg_C 1-214; 


285; 


184- 


148-150; MYRISTYL 


, Fe_hyd_SSU 220-275; 




189,1.045; 


75-80; 






46-76, 1.231; 


PKC_PHOSPHO_SITE 






4-10,1.133; 


98-100; MYRISTYL 






145- 


205-210; MYRISTYI 


j 




150,1.073; 


10-15; 






79-86, 1.128; 


CK2_PHOSPHO_SITE 






222- 


35-38: 
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227, 1.076; 
36- 

110,1.092; 
244- 

253,1.118; 

38-44,1.11; 

263- 

271,1.105; 


:k2_phospho_site 
117-120; myristyl 

206-211; 

2K2_PHOSPHO_SITE 
98-101; 




DEX0449_02 
3 . orf . 1 


N 


0 - ol- 
170; 


133- 

143,1.109; 

50-73,1.14; 

81- 

115,1.154; 
36-48,1.177; 
12-30,1.165; 
148- 

153,1.079; 


PKC_PHOSPHO_SITE 
161-163; MYRISTYL 
143-148; 

PKC_PHOSPHO_SITE 

118- 120; 

PKC_PHOSPHO_SITE 

119- 121; MYRISTYL 

127- 132; 

CAMP_PHOS PHO_S ITE 
129-132; 

PKC_PHOSPHO_SITE 

128- 130; 

PKC_PHOSPHO_SITE 
50-52; 




DEX0449_02 
3 . aa . 1 


N 


0 - il- 
159; 


25-37,1.177; 
122- 

132, 1.109; 

39-62,1.14; 

137- 

142,1.079; 
70- 

104,1.154; 
4-19,1.165; 


PKC_PHOSPHO_SITE 

117- 119; 

PKC_PHOSPHO_SITE 

107- 109; 

PKC PHOSPHORS ITE 
150-152; MYRISTYL 
132-137; 

CAMP_PHOSPHO_SITE 

118- 121; 

PKC_PHOSPHO_SITE 
39-41; 

PKC_PHOSPHO_SITE 

108- 110; MYRISTYL 
116-121; 




DEX0449_02 
3 . orf . 2 


N 


0 - ol- 
170; 


81- 

115,1.154; 
133- 

143, 1.109; 

12-30,1.165; 

148- 

153,1.079; 

50-73,1.14; 

36-48,1.177; 


PKC_PHOSPHO_SITE 
161-163; 

PKCJPHOSPHO_SITE 
50-52; MYRISTYL 
143-148 ; 

PKC_PHOSPHO_J3ITE 
119-121; 

CAMP_PHOSPHO_SITE 
129-132; 

PKC_PHOSPHO_SITE 
128-130; 

PKC_PHOSPHO_SITE 
118-12 0; MYRISTYL 
127-132; 


• 


DEX0449 02 
4. orf .1 


N 


0 - ol- 
255; 


136- 

145,1-106; 
228- 

249,1.154; 
182- 

192,1.188; 

4-20,1.125; 

204- 


PKC_PHOSPHO_SITE 
150-152; MYRISTYL 
228-233; MYRISTYL 
225-230; 

CK2_PHOSPHO_SITB 
241-244; 

CK2_PHOSPHO_SITE 
250-253? 


i 
i 
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210,1.043; 


PKC_PHOSPHOJS ITE 










213- 


24-26; MYRISTYL 










224, 1. 184; 


234-239; 










84-90, 1.092; 


PKC_PHOSPHO_SITE 










28-78, 1.171; 


92-94; MYRISTYL 










117- 


170-175; 










123,1.077; 


PKC_PHOSPHO_SITE 










106- 


81-83; MYRISTYL 










115,1.076; 


55-60; 

CK2_PHOSPHO_SITE 
200-203; 

PKC PHOSPHO SITE 
166-168; 

CK2_PHOSPHO_SITE 
92-95; 

CK2 PHOSPHORS ITE 
24-27; MYRISTYL 
210-215; MYRISTYL 
230-235; 

ASNJ3LYCOSYLATION 
98-101; 












MYRISTYL 65-70; 












CK2_PHOSPHO_SITE 










35-44, 1.174; 
109- 

115, 1.09 ; 
97- 

105,1.083; 
121- 

134,1.19; 5- 
23,1.121; 
87-94, 1.123; 
52-74, 1.216; 


93-96; 

PKC_PHOSPHO_SITE 




DEX0449_02 
4 . aa . 1 


N 


0 - Ol- 
147; 


54-56; 

CTC? PHO S PHO S T TK 

UIWi XT C1\J O JET IlV_0 Xid 

80-83; 

PKC_PHOSPHO_SITE 
23-25; 

CK2_PHOSPHO_SITE 
135-138; 

PKC_PHOSPHO_SITB 
108-110; 

PKC_PHOSPHO_SITE 
45-47; 


CSD 62-129; CSP 64- 
129; COLDSHOCK 65- 
80; COLDSHOCK 86- 
95; 











PV9 OUnQDHrt QTTP 
auu ^uo ; 

PKC_PHOSPHO_SITE 










213- 


24-26; 










224,1.184; 


CK2_PHOSPHO_SITE 










182- 


241-244; 










192,1.188; 


PKC_PHOSPHO_SITE 










84-90, 1. 092; 


150-152; 










117- 


CK2_PHOSPHO_SITE 










123,1.077; 


24-27; MYRISTYL 




DEX0449_02 
4.orf .2 




0 - ol- 
255; 


28-78, 1.171; 


225-230; MYRISTYL 




N 


136- 

145,1.106; 

4-20,1.125; 

204- 

210,1.043; 
228- 

249,1.154; 
106- 

115,1.076; 


230-235; MYRISTYL 
55-60; MYRISTYL 
234-239; MYRISTYL 
170-175; 

PKC_PHOSPHO_SITE 
92-94; 

PKC_PHOSPHO_SITE 
166-168; 

CK2_PHOSPHO_SITE 
250-253; MYRISTYL 
228-233; 

ASN GL Y CO S YLAT I ON 
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c 

I 
f 
C 
( 


J8-101; 

>KC_PHOSPHO_S ITE 
51-83; 

:k2_phospho_sitb 

32-95; MYRISTYIi 
210-215; 




DEX0449_02 
4.orf .3 


N 


0 - Ol- 
243/ 


[ 

94- 

103,1.076; 
192- ! 
198,1.043; 
170- 

180,1.188; 
216- 

237,1.154; 
16-66,1.171; 
72-78,1.092; 
201- 

212,1.184; 
124- 

133,1.106; 
105- 

111,1.077; 


MYRISTYIi 9-14; 

:k2_phospho_site 

238-241; 

^K2_PHOSPHO_SITE 
229-232; MYRISTYL 
216-221; 

CK2_PHOSPHO_SITE 
188-191; MYRISTYL 
222-227; MYRISTYL 
218-223; MYRISTYL 
8-13; 1 
PKCJPHOSPHO_SITE 
69-71; MYRISTYL 
213-218; 

PKC_PHOSPHO_SITE 
154-156; MYRISTYL 
43-48; 

PKC_PHOSPHO_SITE 
13 8-140; MYRISTYL 
158-163; 

PKC_PHOSPHO_SITE 
80-82; 

ASNJ3LYCOSYLATION 
86-89; 

CK2_PHOSPHO_SITE 
80-83; 

PKC_PHOSPHO_SITE 
3-5; MYRISTYL 
198-203; 




DEX0449_02 
4 . aa . 3 


N 


0 - ol- 
147; 


5-23,1.121; 
52-74, 1.216; 
121- 

134,1.19; 
109- 

115,1.09; 
35-44, 1.174; 
97- 

105,1.083; 
87-94,1.123; 


PKC_PHOSPHO_SITE 
54-56; 

CK2_PHOSPHO_SITE 
80-83; 

PKC_PHOSPHO_SITE 
45-47; 

CK2_PHOSPHO_SITE 
135-138; 

PKC_PHOSPHO_SITE 
23-25; 

CK2_PHOSPHO_SITB 
93-96; MYRISTYL 
65-70; 

PKC_PHOSPHO_SITE 
108-110; 


CSP 64-129; 
COLDSHOCK 65-80; 
COLDSHOCK 86-95; 
CSD 62-129; 


DEX0449_02 
4.orf .4 


N 


0 - Ol- 
202; 


129- 

139,1.188; 
160- 

171,1.184; 

23-37,1.092; 

151- 

157,1.043; 

53-62,1.076; 

175- 


MYRISTYL 181-186; 
MYRISTYL 175-180; 
ASN_GLYCOSYLATIOS 
45-48; 

PKC_PHOSPHO_SITE 
29-31; MYRISTYL 
172-177; 

CK2_PHOSPHO_SITE 
188-191: 


r 
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DEX0449_02 
aa.4 



DEX0449_02 
4 . aa . 5 



N 



N 



0 - Ol- 
138; 



196, 1.154 ; 
64-70, 1. 077; 
11-17,1.151; 
83-92,1.106; 



108- 

119, 1.184; 
99- 

105, 1.043; 
123- 

135, 1. 126; 

4-10,1.076; 

12-18,1.077; 

31-40,1.106; 

77-87,1.188; 



0 - Ol 
185; 



124- 

134,1.188; 

78-87,1.106; 

155- 

166,1.184; 
59-65,1.077; 
48-57,1.076; 
170- 

182,1.126; 
146- 

152,1.043; 
5-23,1.121; 



PKC_PHOSPHO_SITB 
39-41; MYRISTYL 
117-122; MYRISTYL 
2-7; 

PKC_PHOSPHO_SITE 
113-115; 

CK2JPHOSPHO_SITE 
39-42; 

CK2_PHOS PHO_S ITE 
197-200; 

CK2_PHOSPHO_SITE 
147-150; MYRISTYL 
157-162; 

PKC_PHOSPHO_SITE 
97-99; MYRISTYL 
177-182; 



CK2_PHOSPHO_SITE 
95-98; MYRISTYL 
105-110; 

PKC_PHOSPHO_SITE 
45-47; MYRISTYL 
123-128; MYRISTYL 
120-125; 

PKC_PHOSPHO_SITE 
61-63; MYRISTYL 
65-70; 



CK2_PHOSPHO_SITE 
142-145; MYRISTYL 
170-175; 

CK2_PHOSPHO_SITE 
34-37; 

PKC_PHOSPHO_SITE 
92-94; 

PKC_PHOSPHO_SITE 
34-36; MYRISTYL 
152-157; 

PKC_PHOSPHO_SITE 
23-25; 

PKC__PHOSPHO_S ITE 
108-110; MYRISTYL 
112-117; MYRISTYL 
167-172; 

ASN_GLYCOSYLATIOK 
40-43; 



DEX0449_02 
4 . orf . 5 



0 - ol 
265; 



214- 

220,1.043; 

4-14,1.143; 

223- 

234,1.184; 
146- 

155,1.106; 
127- 

133,1.077; 
116- 

125, 1.076 ; 
73-91, 1.121; 
192- 

202, 1.188; 
57-65. 1. 074 : 



PKC_PHOSPHO_SITE 
102-104; 

PKC_PHOS PHO_S I TE 
176-178; MYRISTYL 
27-32; 

PKC_PHOSPHO_SITE 
91-93; MYRISTYL 
235-240; MYRISTYL 
31-36; 

PKC_PHOSPHO_SITE 
51-53; MYRISTYL 
238-243; 

PKC_PHOSPHO_SITB 
160-162; 
CK2 PHOSPHO SITE 
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DEX0449_02 
aa . 6 



DEX0449_ 
5 . aa . 1 



02 



N 



|238- 

259,1.154; 
18-27,1.167; 



0 - ol 
201; 



42-56, 1.099; 
33-39,1.074; 
58- 

181, 1.229; 
14-31,1.197; 



0 - Ol- 
159; 



260-263; MYRISTYLl 
180-185; MYRISTYL 
240-245; 

CK2_PHOSPHOJSITE 
102-105; MYRISTYL 1 
42-47; 

CK2_PHOSPHO_S ITE 
210-213; MYRISTYL 
220-225; 

CK2_PHOSPHO_SITE 
251-254; MYRISTYL 
244-249; 

PKC_PHOSPHO_SITE 
15-17; 

ASNLGLYCOSYLATION 
108-111; 



MYRISTYL 95-100; 
PKCJ?HOSPHO_SITE 
30-32; 

CK2_PHOSPHO_SITE 
127-130; MYRISTYL 
143-148; 

CK2_PHOSPHO_SITE 
10-13; 

PKC_PHOSPHO_SITE 
163-165; 

PKC_PHOSPHO_SITE 
10-12; 

CK2_PHOSPHO_SITE 
74-77; MYRISTYL 
67-72; 

PKC_PHOSPHO_SITE 
181-183; 



ER TARGET 198-201; 



149- 

155,1.129; 

17-32,1.132; 

109- 

136,1.177; 
74-88,1.162; 
49-67,1.112; 
99- 

107,1.157; 
34-43,1.178; 



CK2_PHOSPHO_SITE 
138-141; 

K2 JPHOS PHO_S I TE 
|41-44; 

KC_PHOSPHO_SITE 
85-87; MYRISTYL 
144-149; 

CK2_PHOSPHO_SITE 
91-94; 

PKC_PHOSPHO_jSITE 
52-54; 

CK2_PHOSPHO_SITE 
108-111; 

PKC_PHOSPHO_S ITE 
29-31; 

CK2_PHOSPHO_SITE 
143-146; 



DEX044 9_02 
5 . aa . 2 



N 



1 - il 
3 6 ; tm3 

54;o55 
128; 



35-44,1.203; 
4-10,1.103; 
118- . 

124,1.129; 
48-56, 1.102; 
68-76,1.157; 
78- 

105, 1.177 ; 



CK2_PHOSPHO_SITE 
77-80; 

CK2_PHOSPHO_SITE 
60-63; 

ASN_GLYCOSYLATION 
44-47; MYRISTYL 
113-118; 

CK2_PHOSPHO_SITE 
107-110? 
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DEX0449_02 
aa. 1 



DEX0449_02 
6 . aa . 2 



0 - Ol 
224; 



63-74, 1.101; 
148- 

166,1.202; 

43-49,1.048; 

186- 

195,1.118; 
135- 

141, 1.07; 

16-28,1.137; 

124- 

129,1.03; 
114- 

120,1.096; 
170- 

78,1.135; 
198- 
0S, 1.103; 
0-96, 1.08; 
212- 

221,1.109; 



0 - ol 
230; 



PKC_PHOSPHO_SITE 
57-59; 

ASN_GLYCOSYIiATION 

12- 15; 

CK2_PHOSPHO_SITE 
112-115; 

PKC_PHOSPHO_SITE 

13- 15; 



CK2_PHOSPHO_SITB 
119-122; 

CAMP_PH0 S PHO_S ITE 
143-146; 

CK2_PHOSPHO_SITE 
188-191; 

ASNJ3LYCOSYLATION 
59-62; 

PKC_PHOSPHO_SITB 
77-79; 

CK2_PHOSPHO_SITE 
101-104; 

PKC_PHOSPHO_SITE 

141- 143; 

CK2_PHOSPHO_SITE 
108-111; 

PKC_PHOSPHO_SITE 
37-39; 

TYR_PHOSPHO__SITE 
166-174; 

PKC_PHOSPHO_SITE 

142- 144; 



176- 

184,1.135; 
154- 

172,1.202; 
120- 

126, 1.096; 

49-55,1.048; 

218- 

227,1.109; 
204- 

211,1.103; 
141- 

147,1.07; 
69-80,1.101; 
8-34,1.204; 
130- 

135,1.03; 
192- 

201,1.118; 
96-102,1.08; 



SH2 DOMAIN 113-127; 
SH2 113-196; SH2 
111-202; PRO_RICH 
19-102; SH2 113- 
221; 

sp_Q13 094_LCP2_HUMA 
N 113-207; 
SH2 DOMAIN 185-199; 
SH2 DOMAIN 133-143; 



TYR_PHOSPHO_SITE 
172-180; 

PKC_PHOSPHO_SITE 

148- 150; 

PKC_PHOSPH0_SITE 
43-45; 

CK2JPHOSPHO_SITE 
114-117; 

PKCJPHOSPHO_SITB 
83-85; MYRISTYL 
8-13; 

PKC_PHOSPHO_SITE 
147-149; 

CAMP_PHOS PHO_S ITE 

149- 152; 

CK2_PHOSPHO_SITE 
107-110; 
CK2_PHOSPHO_SITB 
194-197; 

CK2_PHOSPHO_SITE 
125-128; 

ASN_GLYCOSYLATION 
65-68; 



SH2DOMAIN 119-133; 
PRO_RICH 25-108; 
sp__Q13 094_IiCP2_HUMA 
N 119-213; SH2 119- 
227; SH2DOMAIN 191- 
205; SH2 117-208; 
SH2 119-202; 
SH2 DOMAIN 139-149; 



DEX044 9_ 
[6 . orf . 2 



0 - ol 
225; 



91-97^1.08; 
136- 

142,1.07; 
149- 

167,1.202; 
213- 



PKC_PHOSPHO__SITE 
78-80; 

PKC_PHOSPHO_SITE 
38-40; 

CK2_PHOSPHO_SITE 
189-192 r 



SH2 114-197; SH2 
114-222; 
sp_Q 13 09 4_LCP2_HUMA 
N 114-208; PRO_RICH 
20-103; SH2 112- 
203; SH2DOMAIN 186- 
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DEX0449_02 
aa. 1 



N 



222, 1.109; 
187- 

196,1.118; 

64-75,1.101; 

125- 

130,1.03; 
171- 

179,1.135; 
199- 

06, 1.103 ; 
44-50,1.048; 
4-29,1.249; 
115- 

121,1.096; 



0 - ol 
1085; 



CK2_PHOSPHO_SITE 

02-105; 
PKC_PHOSPHO_SITE 

142- 144; 

CK2_PHOSPHO_SITE 
120-123; 

ASN_GLYCOSYLATION 
60-63; 

PKC_PHOSPHO_SITE 

143- 145; 

CK2_PHOSPHO_SITE 
109-112; 

CAMP_PHOSPHO_SITE 

144- 147; 

TYR_PHOSPHO_SITE 
167-175; 



1028- 

1040,1.222; 
860- 

866,1.065; 
684- 

694,1.255; 

13-34,1.134; 

660- 

676, 1.101; 
1008- 

1020,1.107; 
176- 

185, 1.108; 
784- 

811,1.127; 
720- 

736,1.173; 
205- 

223, 1.113 ; 
981- 

994, 1.16 ; 

36-43,1.066; 

426- 

450, 1.214 
613- 

624,1.087 
957- 

973, 1.166 
|585- 

592,1.129 
311- 

317,1.111 
357- 

369, 1.131 
286- 

309,1.102; 
704- 

713,1.132; 
117- 

129,1.172; 
884- 

894, 1.151; 
747- 



200; SH2D0MAIN 114- 
128; SH2D0MAIN 134- 
144; 



CK2_PHOSPHO_SITE 
80-83; MYRISTYL 
99-104; MYRISTYL 
98-103; 

CK2_PHOSPHO_SITE 
181-184; MYRISTYL 
335-340; MYRISTYL 

95- 100; MYRISTYL 
512-517; MYRISTYL 

96- 101; MYRISTYL 
683-688; 

CK2_PHOSPHO_SITE 
116-119; MYRISTYL 
42-47; 

CK2_PHOSPH0_SITE 
580-583; MYRISTYL 
889-894; 

ASN__GLYCOS YLAT I ON 
766-769; 

TYR_PHOSPHO_SITE 
323-330; MYRISTYL 
656-661; 

CK2_PHOSPHO_SITE 
299-302; 

PKC_PHOSPHO_SITE 
390-392; MYRISTYL 

679- 684; MYRISTYL 
12-17; MYRISTYL 
738-743; 

CAMP_PHOS PHO_S ITE 
236-239; 
ASN_GLYCOS YLAT ION 
456-459; 

PKC_PHOSPHO_SITE 

680- 682; MYRISTYL 
63-68; MYRISTYL 
76-81; 
ASN_GLYCOSYLATION 
783-786; 

KC_PHOSPHO_SITE 
1939-941; MYRISTYL 
|93-98; 
KC PHOSPHO SITE 



LON 123-368; 
LONJSER 978-986; 
ENDOLAPTASE 1005- 
1024; GLY__RICH 56- 
106; LON 123-368; 
AAA 515-786; 
ENDOLAPTASE 1028- 
1046; ATP_GTP_A 
523-530; Ion 125- 
1073; ENDOLAPTASE 
975-994; 

DISEASERSIST 579- 
593; DISEASERSIST 
518-533; 

ENDOLAPTASE 523- 
542; ENDOLAPTASE 
888-904; AAA 518- 
839; 
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766,1.195 
251- 

257,1.062 
639- 

648,1.073 

16- 
530,1.143 
131- 

144,1.103 
602- 

08, 1. 102; 
264- 

83,1.144; 
815- 

833,1.167; 
738- 

745,1.081; 
1056- 

1072, 1.154; 
941- 

954,1.059; 
925- 

934,1.076; 
321- 

333,1.112; 
480- 

489,1.102; 
869- 

880, 1.117; 
146- 

169, 1.172 
533- 

539,1.047 
499- 

510,1.138 
1046- 

1054,1.093; 
571- 

579,1.12; 
462- 

468, 1.049; 

45-53,1.059; 

835- 

858, 1.106; 
563- 

569, 1.04; 
372- 

378, 1.057; 
903- 

909,1.107; 
106- 

112, 1.104 ; 
392- 

406,1.113; 
5-11,1.071;, 



1012-1014; 
MYRISTYL 634-639; 
PKC_PHOSPHO_S ITE 
975-977; MYRISTYL 
89-94; 

CK2_PHOSPHO_SITE 
975-978; 
AMI DAT I ON 241- 
244; 

PKC_PHOSPHO_SITE 
72-74; 

CK2_PHOSPHO_SITE 
339-342; 

ASN_GLYCOSYLATION 
450-453 ; 

PKC_PHOS PHO_S ITE 
299-301; 

CK2_PHOSPHO_SITE 
770-773; MYRISTYL 
893-898; MYRISTYL 
675-680; 

PKC_PHOSPHO_SITE 
251-253; MYRISTYL J 

734- 739; MYRISTYL 
744-749; MYRISTYL 

735- 740; 

PKC_PHOSPHO_SITE 
239-241; MYRISTYL 
654-659; MYRISTYL 
77-82; MYRISTYL 
81-86; MYRISTYL 
809-814; MYRISTYL 
526-531; 

ASN_GLYCOS YLAT I ON 
174-177; 

PKC_PHOSPHO_SITE 
909-911; MYRISTYL 
446-451; MYRISTYL 
637-642; MYRISTYL 
567-572; 



DEX0449_02 
7.orf .1 



N 



0 - ol 
668; 



444- 

458,1.113; 
615- 

621. 1.04; 



CK2_PHOSPHO_SITE 
132-135; MYRISTYL 
387-392; MYRISTYL 
145-150; 



GLY_RICH 108-158; 
DISEASERSIST 570- 
585; DISEASERSIST 
631-645; ATP GTP A 
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41-48 1 124* J 


\MIDATI0N 293- f 


575-582; AAA 567- 








183 - '* 


296; < 


565; LON 175-420; 








196 1 103 : 1 


\SN GLYCOSYLATION 3 


jON 175-420; | 








532- 


226~22 9; MYRISTYL 










541 1 102 j 


39-44; MYRISTYIj 










585- 


141-146 ; 










591 1 047* 


PKC PHOSPHO SITE 










478- 


303-305; ! 












CK2 PHOSPHO SITE 










623 - 


632-635 ; 










cqi i 12 • 


CK2 PHOSPHO SITE 










363 - 


233^236 ; MYRISTYL 










769 1 111; 

~J \J J I X. m X X X / 


133-138 ; 










424 - 


CK2 PHOSPHO SITE 










430 1 057: 


391-394 ; 










88-95 . 1 . 066 ; 


CK2 PHOSPHO SITE 










257- ! 


168-171; MYRISTYL 










275 1 113 • 


128-133: MYRISTYL 










1 6 Q - 


147-152* MYRISTYL 










181 1 172 • 

X O X / X « X t A f 


150-155 ; 










551 - 


PKC PHOSPHO SITE 










E62 1 13 8 • 

; X . X J O , 


291-293 ; 












CK2 PHOSPHO SITE 












53-26- MYRISTYL 

A JAW , 11 X fVX X X XI 










1 0-16 1 06 • 

X \J X D / X. v/ O / 


148-153 ; 










57-63 1 071» 


PKC PHOSPHO SITE 










51 4 - 


12-14 ; 










con 1 049 • 


CK2 PHOS PHO S ITE 










3 16- 


38-41; MYRI STYL 










77 5 1 1 44 • 

JJ J ( X . X*x *3 , 


64-69 ; 










65-86 1 134 • 


PKC PHOSPHO SITE 










1 CO. 


124-126 • MYRI STYL 
x a *st XaO , niiviiji x xj 










1 64 1 104 • 


578-583 ; 










3 0-36 1 OR5 • 

o\t JO, 1 . U D J / 


ASN GXjYCOSYLATION 

nun wJJ x J x xxn x x vin 










A « O 


508-511 • 

-J V O J X X , 










237 1 1 Oft • 

^ J / / X . X W O / 


PKC PHOSPHO SITE 










637- 


442-444* MYRISTYL 










644 1 1 9Q • 


115-120 ; 










654 - 


CAMP PHOSPHO SITE 










660 1 102 ■ 


288-291; MYRISTYL 










409- 


564-569; MYRISTYL 










421 1 131 • 

*± X , X • X J X | 


498-503 • MYRISTYL 










303- 


619-624; 










309,1.062; 


CK2_PHOSPHO_SITE 










338- 


351-354; 










361, 1.102; 


PKC_PH0 S PHO_S ITE 










568- 


351-353; 










582,1.143; 


CAMP_PHOSPHO_SITE 










373 - 


8-11; MYRISTYL 










385,1.112; 


151-156; MYRISTYL 










97- 


129-134; 










105,1.059; 


TYR__PH0SPHO_S ITE 
375-382; 

ASN GLYCOSYLATION 
502-505; 

PKC_PHOSPHO_SITE 
6-8; MYRISTYL 94- 
99; 




DEX0449 02 


N 


||0 - ol- 


[345- 


llTYR PHOSPHO SITE 


Ion 3-759: AAA 330- 
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7 - aa . 2 




1 

771 ; 1 


1 r 1 ^ n A *7 • 

351 , X . U4 / ; 


1 "iC - 1 A O • 

Ijd- 14 z ; 


c O C . RMDOIjAPTASE 








4/0- 


A cm PT.VPn Q VT .TXTTOW 


"574-590' AAA 327- 








4 y / , i • xz / ; 




472 • TjON SER 664- 








£ JO" 


CA-CQ. MYPTQTYT. 
3 *x O 17 ; I'll tvl Ol ill 


672* DTSEASERSIST 








TO I O 1 A • 


3 / JJ — 3 O *± , ruixliJ 1 1 u 


391-405- LON 2-180* 










C7C -tron . 

3 / 3 3 O W, 


DISEASERSIST 330- 








ZlO , 1 . ilJ / 


r»v"o punQPwrj STTR 


34 5 • ENDOLjAPTASE 








C 1 1 - 
Oil 


CC1 -CCA . 

OOJ. OO** , 


335-354 • 








con 1 H7<C • 
bzu, i . u ? o ; 


DVP PT-inQPTIO QTTT3 


KT<rDOTiAPTASE 714- 








DUl" 


(Too _ 7n n • 

O 270 — / *JU, 


732 • ENDOLAPTASE 








X • lb / ; 


7\C\T flT von CVT ,BTT flM 


6 91-710* LON 5-180 • 










A CO - A CC • 
43 a — *4 33 , 


PTiPPPOTEASEA 441- 








1 AC 1 1 1 O • 
1*1 3 / J. . 11Z , 


v»x\w£ rnuorxiw oxxizj 


4 55- ATP GTP A 3 3 5 - 








3 / vj - 


1 CI -1 CA • 

13 1 — 13*4 , 


34 2 • CIiPPPOTEASEA 








COn 1 1 C 1 • 

joU/ x • idi ; 


zvcvr nT.vr'nQVT.ZLT'TO'Nr 


J JX J4 J / 








b43 - 


6QO £ / x , 


RNDOIiAPTASE 661- 








OJJ; 1 ■ loo ; 


HOlM *v7Xj X tUD IUnX X win 


o o w , 








Ol A — 
/ 14 


oco -occ - 










OOC 1 •)*)*) . 












C CO - 


111-11A. 
xii i x*± ; 










con i i c • 
boU, 1 . lb; 


L. JN-Z _rnUo trxlVJ o Ills 










CO O 


ACC-ACO. MVP T CTYT. 
430-45!?; jruKlolIU 










^ x n 1 ACQ. 

64U, 1 .Uoy ; 


OCQ — OCO . 

zoo AbJ ; 










521- 


ntfft PTJ^CPWri QTTP 
c J\.L- fUL/o JrxlVJ o li.fi 










r ^ / T "inc. 

544 , X . XUb ; 


CCI -CCI . 

bbi-bbJ ; 










•"7 VI O 

742- 


nvp -OTTO CI "DUO CTTT7 

■fcrlvL, irxlvJo IrrHJ o li.fi 










758 , X . 154 ; 


111—11*5. MVPTCTVT. 
111-11.3; lYJlKlolXXj 










coo 

5oy - 


C Q — OO . 

bo- / 3 ; 










c q c i i n o • 

5 y 5 , x . iu / ; 


Jr r*X+ irtlKJairtlXJ o lifi 










oo a 1 1 no , 
/y-yi , i. U02/ 


o no - o n A . 










O 1 o 


IrJxv^ rnUDrnu ol X fi 










/40 / 1 . uy j ; 


CQC — CQO . 

373 -oy / ; 










o - X9 , X . Qol7 ; 


/""VO DUftCBlin QTTP 










ICO 

169- 


OOO— OOC * MYDTCPVT. 

jyA - jyb ; riiKiDiiii 










1 O *l 1 111 . 

XoX , X . 1 J X ; 


i no_i no . MVDTCTVT. 

1UA-1U / ; rdiKlol Xii 










1 o o 
X2 J - 


3 /y-Jo^; 










TOO 1 111. 

i^y ,1.111; 


xtjW- rnuornu oxxfi 










1 Q A 
Xo4 - 


COC-COO • MVP T QTVT . 

oaj'Oa / ; riiKxo xxxj 










i on i Hco . 


7*jo_ OAO • MVP T QTVT, 
j JO - j 1 j , PIXaXO X XXj 










■JOT. 

jy / - 


3OA-30Q. MVP T QTVT. 










ArtA 1 lOQ . 


1 A7- 1 CO • 










*i 1 1 — 


PZiMP PHnQPPft OTTt? 
Oi-iTQir cn\Jk2srn.\J o x xfi 










•300 1 1 "IP, • 

3 4tS « / 1 . 1J O / 


A - 7 . MVP T QTVT . 7^- 
*± — / , ni ai o x x xj /j 








O — CO 1 1 *7 A • 
3 45 — 3 / , 1 . 1 / *3 , 


70* MVP T QTVT i AQC- 












cno - 

3 UU , 










A CO 1 1 O . 

43A , l . lo ; 












C A C 

54b- 












cco i ncc ■ 












O O Q 












342, 1.143 ; 












416- 












425,1.132; 












95- 












121,1.152; 












694- 












706,1.107; 












292- 












301,1.102; 












274- 
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DEX0449_ 
7.aa.3 







280,1.049; 










555- 










566,1.117; 










383- 










391,1.12; 










375- 










381,1.04; 










321- 


CK2_PHOSPHO_SITE 








333, 1. 112; 


181-184; 








205- 


ASN_GLYCOSYLATION 








223 , 1 . 113 ; 


174-177; MYRISTYL 








499- 


810-815 ; MYRISTYL 








510 . 1 . 138 ; 


526-531; MYRISTYL 








131- j 


93-98; MYRISTYL 








144 , 1 . 103 ; 


767-772 ; MYRISTYL 








36-43 , 1 . 066 ; 


76-81; 








286- 


CK2 PH0SPH0_SITE 








309 1 102: 


116-119; 








533- 


ASN GLYCOSYLATION 








539 . 1 . 047 ; 


657-660; MYRISTYL 








585- 


512-517 ; 








592 1 129: 


/NIDATION 241- 








146- 


244 ; 








1 fiQ 1 1 77 • 

/ J. . X 1 $ 


CK2 PHOSPHO SITE 








426- 


8 0 -^3 ; 








450 1 214* 


ASN GLYCOSYLATION 


Ion 125-842; AAA 






117 - 


640-643; MYRISTYL 


515-660 ; 






129 1 172 • 


89-94 ; 


DISEASERSIST 579- 






813 - 


CAMP PHOSPHO SITE 


593; LON 123-368; 






819 1 153 : 


236-239; MYRISTYL 


CLPPROTEASEA 629- 






176- 


446-451; MYRISTYL 


643; LIPOCALIN 803- 






185 1.108; 


763-768; MYRISTYL 


815; AAA 518-713; ! 


2 

2 NT 


0 - ol- 


480- I 


98-103; MYRISTYL 


ENDOLAPTASB 523- 


848 ; 


489.1.102; 


567-572; 


542; ENDOLAPTASE 






658- 


PKC PHOSPHORS ITE 


762-778; 






685 1 127; 


239-241; MYRISTYL 


CLPPROTEASEA 519- 






839- 


824-829 ; 


537; ATP_GTP_A 523- 






845 . 1 . 126 : 


CK2 PHOSPHO SITE 


53 0; GLY_RICH 56- 






743 - 


299-302; MYRISTYL 


106; DISEASERSIST 






754 . 1 . 117 : 


99-104; MYRISTYL 


518-533; LON 123- 






251- 


12-17; MYRISTYL 


368; 






257, 1 . 062 ; 


683-688; MYRISTYL 








604- 


42-47; MYRISTYL 








613 , 1 . 132 ; 


95-100; 








734- 


ASN GLYCOSYLATION 








740, 1 . 065 ; 


450-453; MYRISTYL 








392- 


77-82 ; 








406 1 113 • 


PKC PHOSPHO SITE 








784- 


72-74- MYRISTYL 








790,1.079; 


81-86; MYRISTYL 








13-34,1.134; 


96-101; 








516- 


PKCJPHOSPHO_SITE 








530, 1.143; 


299-301; MYRISTYL 


1 






264- 


63-68; 








283,1.144; 


TYR_PHOSPHO_SITE 








758- 


323-330; 








768,1.151; 


PKC PHOSPHO SITE 








709- 


251-253; 








732.1.106: 


PKC PHOSPHO SITE 
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563- 

569,1.04; 
357- 

69,1.131; 
462- 

468,1.049; 
311- 

317,1.111; 
571- 

579,1.12; 

45-53,1.059; 

794- 

809,1.217; 
620- 

640,1.18; 
372- 

378,1.057; 
106- 

112,1.104; 
689- 

707,1.167; 
5-11,1.071; 



792-794; 

AMI DAT I ON 778- 

781; 

CK2_PHOSPHO_SITE 
580-583; 

CK2_PHOSPHO___SITE 
339-342; 

ASN_GLYCOSYLATION 
456-459; 

CK2_PHOSPHO_SITE 
644-647; MYRISTYL 
335-340; 

PKC_PHOSPHO_SITE 
390-392; 



DEXO 449_02 
8 . aa . 1 



N 



0 - ol 
124; 



4-14, 1. 157; 
110- 

116,1.024; 
37-45,1.061; 
48-53,1.048; 
62-79,1.175; 



MYRISTYL 115-120; 

CK2JPHOSPHO_SITE 

15-18; 

TYR_PHOSPHO_SITE 
58-66; 

CK2_PHOSPHO_SITE 
99-102; 

PKC_PHOSPHO_SITE 
37-39; 

TYR_PHOSPHO_SITE 
45-53; 

ASN_GLYCO S YLAT I ON 
90-93; 

CK2_PHOSPHO_SITE 
30-33; 



DEX0449_ 02 
9 .aa. 1 



N 



1 - il- 
16;tml7 

34;o35- 
296; 



11-36, 


1.154; 


181- 




194,1. 


132; 


261- 




267,1. 


144; 


213- 




221,1. 


069; 


95- 




119,1. 


183; 


60-70, 


1.115; 


247- 




254,1. 


114; 


148- 




161,1. 


085; 


125- 




139,1. 


129; 


167- 




177,1. 


148; 



CK2_PHOSPHO_SITE 
242-245; 

PKC_PHOSPHO_SITE 
3-5; MYRISTYL 
258-263; 

CK2_PHOSPHO_SITE 
40-43; 

PKC_PHOSPHO_SITE 
219-221; 

PKC_PHOSPHO_SITE 
146-148; MYRISTYL 
155-160; 
ASNJ3LYCOSYLATION 
239-242; 

PKC_PHOSPHO_SITE 
179-181; 

CK2_PHOSPHO_SITE 
195-198; MYRISTYL 
22-27; 

CK2_PHOSPHO_SITE 
179-182; 

CAMP PHOSPHO SITE 



KH 51-120; 
KH_TYPE_1_1 52-115 
KH 56-103; KH 128 
178; KH 123-195; 
KH_TYPE_1_2 124- 
190; 
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79-82; 




DEX0449_03 
0 . aa . 1 


N 


1 - il- 
164;tml 
65- 

187;Ol8 
8-195; 


11-17,1.104; 
83- 

101,1.178; 
140- 

151,1.186; 
24-32,1.128; 
42-78,1.18; 
114- 

126,1.137; 
165- 

192, 1.253; 
105- 

112,1.023; 


PKC_PHOSPHO_SITE 
161-163; 

PKC_PHOSPHO_S ITE 
99-101; MYRISTYIi 
181-186; 

ASN_GLYCOSYLATION 
21-24; 

PKC_PHOSPHO_SITE 
68-70; MYRISTYL 
185-190; MYRISTYL 
158-163; 
AMIDATION 191- 
194; 

ASN_GLYCOSYLATION 
129-132; MYRISTYL 
171-176; 

CK2_PHOSPHO__SITE 
131-134; 

ASN__GL YCO S YLAT I ON 
93-96; 


MHCCLASSI 136-154; 
MHCCLASSI 65-81; 


DEX0449_03 
0 .orf . 1 


N 


1 - ol- 
165;tml 
66- 

188;il8 
9-194; 




ASN_GLYCOSYLATION 
128-131; 
AMIDATION 190- 
193; MYRISTYL 
170-175; 

CK2 PHOSPHO SITE 
3-6; 

PKC_PHOSPHO_SITE 
67-69; AMIDATION 
3-6; 

ASN_GLYCOSYLATION 
92-95; MYRISTYL 
157-162; 

CK2 PHOSPHO_SITE 
130-133; MYRISTYL 
184-189; 

PKC PHOSPHO_SITE 
3-5; 

PKC_PHOSPHO_SITE 
98-100; 

ASN_GLYCOSYLATION 
20-23; MYRISTYL 
7-12; MYRISTYL 
180-185; 

PKC_PHOSPHO_SITE 
160-162; 

CK2_PHOSPHO_SITE 
7-10; 

ASN_GLYCOSYLATION 
6-9; 

CAMP_PHOSPHO_SITE 
15-8; MYRISTYL 2- 
||7; 


MHCCLASSI 64-80; 
MHCCLASSI 135-153; 


DEX0449_03 

1 . aa . 1 


M 


0 - ol- 
132; 


122- 

129, 1. 15; 
10-24,1.133; 
67-74,1.074; 
106- 


IjMYRISTYL 22-27; 
PKC_PHOSPHO_SITE 
33-35; 

|ASN_GLYCOSYLATIOK 
1117-20: 
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: 


117,1.25; 2 


\SU GLYCOSYLATION 








< 


?4- : 


2-5; 










L02, 1.038; 


?KC_PH0SPHO_S ITB 
L18-120; MYRISTYL 
51-66; 

PKC PHOSPHORS ITE 
i-6; 

CK2 PHOSPHO_SITE 
4-7; AMIDATION 
103-106; MYRISTYL 
89-94; 

CAMP_PHO S PHO_S ITB 
76-79; 

PKC_PHOSPHO_SITE 
32-34; 

CK2_PHOSPHO_SITE 
88-91; 

ASN_GLYCOSYLATION 
27-30; 













CK2_PHOSPHO_SITE 












192-195; 












PKC_PHOSPHO_SITE 












166-168; 












PKC PHOSPHO_SITE 












3-5; 












PKC_PHOSPHO_SITE 












148-150; 












PKC_PHOSPHO_SITE 










141- 


102-104; 










146,1.033; 


PKC_PHOSPHO_SITE 










46-52,1.027; 


20-22; 










54-61,1.105; 


PKC_PHOSPHO_SITB 




DEX0449_03 


N 


0 - ol- 


153- 


184-186; 


LYS_RICH 5-40; 


l.orf .1 


199; 


163,1.116; 
171- 

192,1.152; 
94- 

101,1.084; 


PKC_PHOSPHO_SITE 
50-52; 

CK2__PHOSPHO__S ITE 
148-151; 

ASN_GLYCOS YLAT ION 
167-170; 

CK2 PHOSPHO_SITE 
79-82; 

CAMP_PHOSPHO_SITE 
12-15; 

ASNJ3LYCOSYLATION 
77-80; 

PKC_PHOSPH0_SITE 
79-81; 










122- 


ASNJ3LYCOSYLATION 










131, 1.066; 


130-133; MYRISTYL 










136- 


98-103; MYRISTYL 


UBCC 111-255; 








174,1.19; 


53-58; 


UQ_con 107-250; 








100- 


CK2_PHOSPHO_SITE 


UB I QUI T IN_CON JUGAT_ 


DEX0449_03 


N 


0 - ol- 


-106,1.068; 


132-135; MYRISTYI 


1 1 181-195; 


2 . aa . 1 


259; 


82-89,1.039; 
190- 

197,1.171; 
H232- 


52-57; AMIDATION 
2-5; 

PKC_PHOSPHO_SITE 
250-252; 


sp_014933_UBC8_HUMA 
N 111-250; 
UB I QUI T I N_CON JUGAT_ 
2 111-244; 








p38, 1.086; 


PKC_PHOSPHO_SITE 
|45-47: MYRISTYL 










J|204- 
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: 
* 


220,1.175; 1 
L77- 6 
L84, 1.082; C 
LO-16, 1.056; 1 
21-53,1.139; I 
3 

C 
< 

] 


.7-22; MYRISTYL 
J5-70; 

;k2_phospho_site 

lIS-118; 

>KCJPHOSPHO_SITE 
L77-179; MYRISTYL 
?4-79; 

:k2_phospho_site 

53-96; 

?KC_PHOSPHO_SITE 
33-95; 

:k2_phospho_site 
l13-116; myristyl 
70-75; myristyl 

S9-74; 




DEX0449_03 
2 . aa . 2 


N 


0 - il- 
156; 


87-94,1.171; 
129- 

135,1.086; 

74-81,1.082; 

101- 

117, 1.175 ; 

19-28,1.066; 

33-71,1.19; 


PKC_PHOSPHO_SITE 
147-149; 

PKC_PHOSPHO_SITE 
74-76; 

CAMP PHOSPHO_SITB 
2-5; 

ASNJ3LYC0SYLATION 
27-30; 

CK2_PHOSPHO_SITE 
29-32; 

PKC_PHOSPHO_SITE 
5-7; 

CK2 PHOSPHO_SITE 
12-15; 


UBCc 8-152; 
UBIQUITIN_CONJUGAT_ 

1 78-92; 

UB IQUITIN_CONJUGAT_ 

2 13-141; 

sp_0 1 4 9 3 3_UB C 8_HUMA 
N 13-147; UQ_con 
11-147; 


DEX0449_03 
2 .orf .2 


N 


0 - ol- 
168; 


31-40,1.066; 
45-83, 1.19; 
4-10, 1.1; 
99- 

106, 1.171; 
141- 

147,1.086; 

86-93,1.082; 

113- 

129,1.175; 


PKC_PHOS PHO_S ITE 
159-161; 

CK2_PHOSPHO_SITE 
41-44; 

ASN_GLYCOSYLATION 
39-42; MYRISTYL 
16-21; 

PKC_PH0SPH0_SITE 
86-88; 


UB I QUI T IN_CON JUGAT_ 
2 20-153; UBCc 20- 
164; UQ_con 15-159; 
sp_014 93 3_UBC8_HUMA 
N 25-159; 

UBIQUITIN_CONJUGAT_ 
1 90-104; 


DEX0449_03 
2 . aa . 3 


N 


0 - Ol- 
137; 


82-98,1.175; 
68-75,1.171; 
110- 

116,1.086; 
14-52,1.19; 
55-62, 1.082; 


AMI DAT I ON 2-5; 

PKC_PHOSPHO_SITE 

55-57; 

ASN_GLYCOSYLATIOK 
8-11; 

PKC PHOSPHO_SITE 
128-130; MYRISTYI 
7-12; 

CK2_PHOSPHO_SITE 
10-13; 


i 

UBCC 3-133; 

UB I QUI T I N_CON JUGAT_ 
f 2 12-122; 

UB I QUITIN_CONJUGAT_ 

1 59-73; 
j sp_014933__UBC8__HUMA 

N 8-128; UQ_con 1- 

128; 


DEX0449_02 
2 .aa .4 




0 - Ol- 
237; 


155- 

162,1.082; 
113- 

118,1.041; 
" 168- 
175, 1.171; 
121- 

152, 1.095 ; 
||75-82. 1.124; 


ASN__GLYCOS YLAT I Ob 
24-27; 

PKC_PHOSPHO_SITE 
65-67; MYRISTYL 
85-90; 

PKC_PHOSPHO_SITE 
155-157; 

PKC_PHOSPHO_SITB 
4-6; 


I sp_pi4933_UBC8_HUMA 
N 126-228; 
sp_014933_UBC8_HUMA 
N 3-42; UQ_con 1- 
228; 

UB I QUI T I N_CON JUGAT_ 

1 159-173; 

UB I QUITIN_CON JUGAT_ 

2 122-222; UBCc 5- 
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95-101,1.1; 
30-58,1.19; 
182- 

198,1.175; 
210- 

216,1.086; 

16-25,1.066; 

5-14,1.074; 



PKC_PHOSPHO_SITE 
228-230; 

CK2_PHOSPHO_SITE 
26-29; 

PKC_PHOS PHO__S ITE 
118-120; 



233; 



DEX0449_03 
2 . orf . 4 



N 



0 - ol- 
156; 



32-37,1.041; 
129- 

135, 1. 086; 
101- 

117,1.175; 
74-81,1.082; 
87-94,1.171; 
40-71,1.095; 



CK2_PHOSPHO_SITE 
16-19; 

PKC_PHOS PHO_S I TE 
147-149; 

PKC_PHOSPHO_SITE 
74-76; 

PKC_PHOSPHO_SITE 
37-39; MYRISTYL 
14-19; 



UBIQUITIN_CONJUGAT 
2 41-141; UQ_COIl 
14-147; 

UBIQUITIN_CONJUGAT 
1 78-92; UBCC 12- 
152; 

sp_0 1493 3_UBC8_HUMA 
N 45-147; 



DEX0449_03 
3 . aa . 1 



N 



0 - Ol< 
88; 



51-58,1.097; 

4-12,1.097; 

23-30,1.121; 



PKC_PHOSPHO_SITE 
38-40; 

CK2_PHOSPHO_SITE 
48-51; 

ASN_GLYC0SYLATION 
79-82; 



DEX044 9„03 
3 . aa . 2 



N 



0 - ol 
40; 



21-27,1.087; 
31-37,1.071; 



MYRISTYL 7-12; 
AMIDATION 10-13; 



DEX0449_03 
3 .orf .2 



0 - ol- 
60; 



21-39,1.171; 
11-17,1.065; 



MYRISTYL 17-22; 
MYRISTYL 55-60; 



DEX0449_03 
4 . aa . 1 



N 



0 - ol- 
213; 



PKC_PHOSPHO_SITE 
77-79; 

CK2_PHOSPHO__SITE 
64-67; MYRISTYL 
108-113; 

PKC_PHOSPHO_SITE 
209-211; 

CK2_PHOS PHO_S ITE 
161-164; 

CK2_PHOSPHO_SITE 
122-125; MYRISTYL 
23-28; 

CK2JPHOSPHO_SITE 
113-116; MYRISTYL 
5-10; 

CK2_PHOSPHO_SITE 
77-80; 

PKC_PHOSPHO_SITE 
64-66; MYRISTYL 
73-78; 

CK2_PHOSPHO_SITE 
209-212; 



DEX0449_ 
5 . aa . 1 



03 



N 



0 - ol- 
458; 



223- 

245,1.186 
187- 

196,1.158 
175- 

180,1.057 
427- 

436,1.181 
279- 



ASN__GLYCO SYLATI ON 
328-331; MYRISTYL 
243-248; 

PKC_PHOSPHO_SITE 
352-354; 

ASN_GLYCOSYLATION 
171-174; 

PKC_PHOSPHO_SITE 
420-422; 



ANK 352-381; ANK 
219-248; ANK 388- 
421; ank 291-350; 
ANK_REPEAT_3 3 88- 
424; ank 183-215; 
ank 388-424; ANK 
183-212; ANK 291- 
321; ANKYRIN 405- 
417? ANK REPEAT 2 
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DEX0449_03 
5 . or f . 1 



N 



0 - ol 
439; 



286,1.076; 
256- 

263,1.17; 
28-61, 1.087; 
72-85,1.167; 
366- 

385, 1.175, 
198- 

205, 1.169; 
108- 

114, 1.091i 
293- 

310,1.204; 
316- 

347,1.133, 
119- 

125, 1.087; 
154- 

162, 1.094; 
265- 

276,1.128; 
355- 

361, 1.101; 
392- 

412, 1.161; 
100- 

106,1.066, 
274- 

291,1.204 
100- 

106, 1.087 
168- 

177,1.158 
179- 

186, 1.169 
297- 

328,1.133 
4-42, 1.154; 
246- 

257,1.128 
204- 

226, 1.186 
336- 

342,1.101 
347- 

366,1.175 
53-66,1.167; 
260- 

267,1.076; 
135- 

143,1.094; 
156- 

161, 1.057 ; 
237- 

244, 1 .17; 
408- 

417, 1.181; 

81-87,1.066; 

373- 



CK2_PHOSPHO_SITE 
330-333; 

ASN_GLYCOSYLATION 
149-152; 
ASN_GLYCOSYLATION 
77-80; AMIDATION 
194-197; 

ASNJ3LYCOSYLATION 
247-250; 

PKC_PHOS PHO_S ITE 
43-45; MYRISTYL 
275-280; MYRISTYIi 
416-421; 

CK2_PHOSPHO_SITE 
120-123; MYRISTYL 
145-150; 
ASN_GLYCOS YLAT ION 
136-139; 

CK2_PHOSPHO_SITE 
288-291; 

CK2_PHOSPHO_SITE 
18-21; 

PKC_PHOSPHO_SITE 
334-336; 

CK2_PHOSPHO_SITE 
404-407; 



352-384; ank 352- 
384; ANK_REPEAT_1 
219-251; 

ANK_REP_REGION 183- 
432; ank 252-290; 
ank 219-251; 
ANKYRIN 253-265; 
ANK 252-283; 



ASN__GLYCO S YLATI ON 
309-312; 
ASN_GLYCOSYLATION 
13 0-133; MYRISTYL 
224-229; 

ASN_GLYCOSYLATIOM 
152-155; 

ASN_GLYCOSYLATION 
228-231; MYRISTYL 
256-261; 

CK2_PHOS PHO_S ITE 
101-104; 
AMIDATION 175- 
178; 

PKC_PHOSPHO_SITE 
315-317; 

PKC_PHOSPHO_SITE 
333-335; MYRISTYL 
126-131; MYRISTYL 
397-402; 

PKC_PHOSPHO_SITE 
401-403; 

ASN_GLYCOSYLATION 
58-61; 

CK2_PHOSPHO_SITE 
311-314; 

PKC_PHOSPHO_SITE 
24-26; 

CK2 JPHOS PHO_S ITE 
385-388; 

CK2_PHOSPHO_SITE 
269-272? 



ANK 272-302; ANK 
369-402; 

ANKJREPEAT_1 200- 
232; ANKYRIN 234- 
246; ANKYRIN 386- 
398; ANK_RE PEAT_3 
369-405; ANK 233- 
264; ANK 200-229; 
ank 272-331; ANK 
164-193; 

ANK_REP_REGION 164- 
413; ank 200-232; 
ANK_REPEAT_2 333- 
365; ank 333-365; 
ank 164-196; ank 
369-405; ank 233- 
271; ANK 333-362; 
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1 




393,1.161; 
B9-95, 1.091; 


\SN_GLYCOSYLATION 
117-120; 




DEX0449_03 
6 . aa . 1 


Y J 


0 » ol- 
130; 


4-21,1. 165 ; 

23-43,1.129; 

110- 

127,1.174; 

49-67,1.178; 

82- 

102,1.143; 


CK2_PHOSPHO_SITB 
58-71; 

PKC_PHOSPHO_SITE 
81-83; AMIDATION 
74-77; MYRISTYL 
19-24; 

CK2_PHOSPHOJ3ITE 
81-84; 

CAMP_PHOSPHO_SITE 
76-79; 

PKC__PHOSPHO_SITE 
127-129; 




DEX0449_03 
6 . orf . 1 


N 


0 - Ol- 
243; 


157- ! 
196,1.173; 
83-93,1.055; 
215- 

223, 1.167; 

11-37,1.163; 

139- 

149, 1.056; 
225- 

240,1.175; 
110- 

116, 1.067; 

49-61,1.064; 

119- 

131,1.061; 
201- 

208,1.12; 


MYRISTYL 135-140; 
PKC_PHOSPHO_SITE 
173-175; MYRISTYL 
19-24; MYRISTYL 
61-66; MYRISTYL 
121-126; MYRISTYL 
125-130; MYRISTYL 
46-51; 

CK2_PHOSPHO_SITE 
196-199; MYRISTYL 
151-156; MYRISTYL 
37-42; 

PKC_PHOSPHO_SITE 
206-208; MYRISTYL 
33-38; MYRISTYL 
50-55; 

CK2_PHOSPHO_SITE 
206-209; 


TGF BETA 2 145-231; 
sp_Q6 4 2 8 0_TGF4_MOUS 
E 145-230; TGF-beta 
137-231; TGF_BETA 1 
159-174; TGFB 141- 
231; 


DEX0449_03 
7 . aa . 1 


N 


0 - ol- 
204; 


152- 

164,1.099; 
138- 

144,1.057; 
194- 

201,1.064; 
4-21,1.091; 

O \J O X , X a X — ' , 

125- 

131,1.074; 
63- 

113,1.187; 


CK2_PHOSPHO_SITE 
113-116; 

CK2_PHOSPHO_SITE 
28-31; MYRISTYL 
153-158; MYRISTYL 
134-139; 

CK2J?HOSPHO_SITE 
109-112; 

ASN_GLYCOSYLATION 
59-62; MYRISTYL 
9-14; 

PKC_PHOSPHO_SITB 
163-165; 

CK2_PHOSPHO_SITE 
141-144; 




DEX0449_03 
8 . aa . 1 


Y 


0 - ol- 
80; 


4-36,1.28; 
67-77,1.06; 


CK2_PHOSPHO_SITE 
64-67; MYRISTYL 
71-76 ; 




DEX0449_03 
8 . orf . 2 


N 


0 - ol- 
84; 


15-21,1.024; 
65-71,1.084; 
29-37,1.061; 


CK2_PHOSPHO_SITE 

59- 62; 

PKC_PHOSPHO_SITE 
12-14; 

PKC_PHOSPHO_SITE 
41-43; 

CK2_PH0SPH0_SITE 

60- 63: 
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DEX0449_03 
8 .aa.3 



3EX0449_03 
l|9 . aa . 1 



||PBX044 9 04 



56; 



Ol- 



4-17,1.159; 
34-53,1.146; 



CK2_PHOSPHO_SITE 
5-8; 

CAMP_PHOSPHO_SITE 
74-77; 

PKC_PHOSPHO_SITE 
22-24; 



0 - ol- 
451; 



26-42,1.143; 
70-77,1.072; 
4-11, 1. 207; 
410- 

415,1.049; 
181- 

193,1.095; 
88- 

100,1.087; 
368- 

381,1.12; 
328- 

341,1.121; 

62-68,1.081; 

200- 

260,1.115; 
16-24,1.105; 
46-60,1.115; 
305- 

311,1.064; 
264- 

277,1.08; 
433- 

439,1.075; 
112- 

118, 1.13; 



N 



ol- 



MYRISTYL 351-356; 
MYRISTYL 365-370; 
CK2_PHOSPHO_SITB 
82-85; 

CAMP_PHOSPH0_S ITE 
360-363; 

PKC_PHOSPHO_SITE 
389-391; 

CK2__PH0SPH0_SITE 
336-339; MYRISTYL 
415-420; MYRISTYL 
106-111; 

CK2_JPHOSPHO_SITE 
160-163; 

PKC_PHOSPHO_SITE 
322-324; 

CK2_PHOSPHO_SITE 
125-128; 

CK2_PHOSPHO_SITE 
20-23; MYRISTYL 
254-259; 

CK2__PHOSPHOjSITE 
414-417; 

CK2_PHOSPHO_SITE 
294-297; 

CK2_PHOSPHO_SITE 
96-99; 

LEUCINE_ZIPPER 
232-253; 

ASN_GLYCOSYLATION 
158-161; 

TYR_PHOSPH0_SITE 
244-251; MYRISTYL 
348-353; 

CK2_PHOS PHO_S ITE 
166-169; 

CK2_PHOSPHO__SITE 
129-132; MYRISTYL 
438-443; 

CK2_PHOSPHO_SITE 
143-146; 
AMI DAT I ON 75-78; 
MYRISTYL 407-412; 
MYRISTYL 349-354; 
ASN_GLYCOSYLATlON 
164-167; 

CK2_PHOS PHO_S ITE 
61-264; 

>KC_PHOSPHO_SITE 
59-361; 



TPR 
78; 
TPR_ 
TPR_ 
238; 
TPR 
GLU_ 
ATP~ 

tpr" 

TPR_ 
TPR 
80") 



45-78; TPR 45 
TPR 247-280; 
REPEAT__1 45-78 
REPEAT_2 205- 
' TPR 247-280; 
205-238; 
RICH 113-321; 
"GTP_A 89-96; 
205-238; 
REGION 205-280 
"REPEAT 3 247 



|l89- 



2K2 PHOSPHO SITE |gmd 24-331; 
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0 • aa . 1 




107 , * 




>92-295- MYRISTYL 












L98-203 • 








c 


:c.7c 1 Q46 - 

) 3 ' J f X • w , 


^K2. PHOSPHO SITE 










5 2? 1 


52-35 ; 










JU1, 1 • UO / 


PKC PHOSPHO SITE 










*1 7 - 

5 J. J 


277-279 • MYRISTYL 










iiQ 1 1 o*7 • 
>13 , 1 . / , 


143-148 ; 










fiDJ." 


PKC PHOSPHO SITE 










JfiA 1 ft Q G • 

60U / x • yjo / 


52-64 ; 












rYR PHOSPHO SITE 










133 


115-123 ; 










X03,X*U0O, 


PKC PHOSPHO SITE 










ftQ-inn 1 17 • 

DJ"1UU / X - X. / , 


10-12 ; 










-in 1 115. 


CAMP PHOSPHO SITE 










Ol Q _ 


1fl6-Tfi9- 

IOO O J7 , 










oia 1 n*ci • 

1 . UO JL , 


CAMP PHOSPHO SITE 
wiriiT it nu 0 xr Hw »j 1x0 












3 3 3 0 f L i X XVI M X X i-J 












u — 10 , 










172 - 


nu-n punQPtrn cttr 

It J\v_- XrXlvJOXrXlvJ Ollfi 










1 to 1 A £ T • 

1 / 0 , x . Uo x ; 


1 IO -1 7A • 

1JZ X J*± , 












pvo PTTOSPHO STTR 










x u / - 


1 R4 -1 . 

13*± 13 / , 










1 C *7 1 1 7A . 


PKC PHOSPHO STTT5 












245-247; MYRISTYL 












206-211; MYRISTYL 












33-38; 












CK2__PHOSPHO_SITE 












245-248; 












PKC_PH0SPHO_SITE 












185-187? MYRISTYL 












36-41; 












PKC_PHOSPHO_SITE 












147-149; 












CK9 PHOSPHO SITE 












52-3 5- MYRISTYL 










ai-jU; x • 11^ ; 












71 a - 


PlMp PWOQPWO STTE 










■JOA 1 1 Q7 . 


CC.CD . 
3-* 3 O , 










•3 *r _ c<7 1 171. 


PKC PHOSPHO SITE 

XT XVV» f *XV^ i9 S nv 0 A u 










1 / * 


147-149- MYRTSTYL 

X *X / — 1 *± ^ , A*l IXvXP 1 111 










1 QA 1 1 71 • 

ly^t , 1 . 1 / x ; 


"541 - 










<6DJ 


mvp PWOQPHO GTTP1 
XXxv rfiuornu oixxa 










ZD 3 / x • uo x ; 


X X 3 X £. J , 










1U / - 


irxvv. xrxlwO xr Xlw 0 1 X la 










10 / / 1 . 1 0 flfc ; 


CO- ^4 . 




DEX04 4 9 04 




u - ox - 


1* / 


ptro PHOSPHO SITE 


ADH SHORT 166-194; 


0 . aa . 2 




Jo <£ ; 




ZOU"*OJ , 


omd. 24-332 ? 








zyo , x . U07 ; 


nv-p PWOQPHO STTE 










13 i7 


iu - 1^ ; 










165,1.066; 


PKC_PHOSPHO_SITE 










224- 


132-134; MYRISTYI 










243,1.136; 


143-148; 










65-75,1.046; 


CK2_PHOSPHO_SITE 










89-100,1.17; 


154-157; MYRISTYI 


j 








270-284,1.1; 


13-18; 










196- 


PKC_PHOSPHO_SITE 










209,1.113; 


211-213; MYRISTYI 


j 










233-238; 












PKC PHOSPHO SITE 
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312-314; 












^MP_PHOSPHO__S ITE 












221-224; MYRISTYL 












33-38; 












PKC_PHOSPHO_SITB 












280-282; 












CK2_PHOSPHO_SITE 












327-330; MYRISTYL 












36-41; 












PKC_PHOSPHO_SITE 












220-222; 












PKC_PHOSPHO_SITE 












147-149; 












PKC_PHOSPHO_SITE 












132-134; 










172- 


CK2 PHOSPHORITE 










194,1.171; 


32-35; MYRISTYL 










196- 


33-38; 










209,1.113; 


PKC PHOSPHO_SITE 










286- 


10-12; MYRISTYL 










295,1.089; 


204-209; 










159- 


PKC_PHOSPHO_SITB 










165,1.066; 


280-282; 










253- 


CK2 PHOSPHOjSITE 










259,1.061; 


154-157; 










314- 


PKC_PHOSPHO_SITE 










324,1.183; 


62-64; MYRISTYL 










343- 


352-357; MYRISTYL 




DEX0449_04 




0 - ol- 


352,1.075; 


13-18; 


gmd 24-361; 


0 . aa . 3 


N 


382 ; 


224- 


CK2_PHOSPHO_SITE 


ADH_SHORT 166-194; 








243,1.136; 


280-283; 










107- 


PKC_PHOSPHO_SITE 










157,1.134; 


220-222; 










36-57,1.131; 


TYR_PHOSPHO_SITE 










65-75,1.046; 


115-123; MYRISTYL 










4-11, 1.092; 


143-148; 










24-30, 1.112; 


CAMP PHOSPHO_SITE 










357- 


55-58; MYRISTYL 










374,1.131; 


241-246; 










326- 


PKC_PHOSPHO_SITE 










341,1.108; 


211-213; 










89-100,1.17; 


PKC_PHOSPHO_SITE 










270-284,1.1; 


312-314; 












CAMP_PHOS PHO_S ITE 












221-224; MYRISTYL 












233-238; MYRISTYL 












36-41; 










41-48,1.083; 


PKC_PHO S PHO_S ITE 










245- 


232-234; MYRISTYL 


i 








255,1.126; 


159-164; 


□Q_con 9-154; 








147- 


AMI DAT I ON 232- 


Sp_Q9BTCl — Q9BTC1_HU 








154,1.063; 


235; 


MAN 28-153; UBCc 


DEX0449_04 




0 - ol- 


•134- 


CAMP_PHOSPHO_SITE 


! 14-157; 


1 . aa . 1 


N 


258; 


139,1.059; 


241-244; 


OBIQUITIN_CONJUGAT_ 








108- 


CK2_PHOSPHO_SITE 


1 84-98; 








124, 1.184 ; 


178-181; MYRISTYI 


j OB I QUITIN_CON JUGAT_ 








92-99,1.184; 


174-179; 


2 14-146; 








6-23,1.154; 


|PKC_PHOSPHO_SITE 










|212- 


||ll5-117: 
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DEX0449_04 
1 . aa . 2 



N 



0 - ol- 
262; 



220,1.139; 

61-69,1.073; 

168- 

174,1.029; 
189- 

197,1.112; 
71-88,1.083; 



226- 

232,1.029; 
166- 

182,1.184; 

23-30,1.141; 

205- 

212,1.063; 
192- 

197,1.059; 
99- 

106,1.083; 

65-81,1.154; 

129- 

146,1.083; 

47-60,1.082; 

119- 

127,1.073; 

4-18,1.094; 

150- 

157,1.184; 



CK2_PHOSPHO_SITE 
173-176; MYRISTYL 
52-57; 

PKC_PHOSPHO_SITE 
162-164; 
AMIDATION 185- 
188; 

CAMP_PHOSPHO_SITE 
205-208; MYRISTYL 
247-252; 

PKC_PHOSPHO_SITE 
240-242; 

CK2_PHOSPHO_SITE 
3-6; AMIDATION 
203-206; 

PKC_PHOSPHO_SITE 
244-246; 

ASN_GLYCOSYLATION 
225-228; 



MYRISTYL 61-66; 
CK2_PHOSPHO_S ITE 
231-234; MYRISTYL 
248-253; 

AMIDATION 31-34; 
MYRISTYL 232-237; 
PKC_PHOSPHO_SITE 
173-175; MYRISTYL 
110-115; MYRISTYL 
247-252; 

AMIDATION 41-44; 
CK2JPHOSPHO_SITE 
236-239; MYRISTYL 
217-222; MYRISTYL 
45-50; MYRISTYL 
24-29; MYRISTYL 
38-43; 

PKC_PHOSPHO_SITE 
220-222; MYRISTYL 
244-249; MYRISTYL 
254-259; 



sp_Q9BTCl_Q9BTCl_HU 
MAN 86-211; UBCc 
72-215; 

UBIQUITIN_CONJUGAT 

72-204; 
UB I QUI T IN_CON JUGAT_ 
1 142-156; UQ_con 
67-212; 



DEX0449_04 
1 . or f . 2 



0 - ol 
291; 



140- 

157,1. 

34-41, 

177- 

193,1. 

58-71, 

76-92, 

110- 

117,1. 

216- 

223, 1. 

161- 

168,1. 

12-29, 

130- 

138,1. 

237- 

243,1. 

203- 



083; 
1.141; 

184; 
1.082; 
1.154; 

083; 

063; 

184; 
1.094; 

073; 

029; 



MYRISTYL 255-260; 
MYRISTYL 56-61; 
CK2_PHOSPHO_S ITE 
247-250; 

PKC_PHOSPHO_SITE 
231-233; 

PKC_PHOSPHO_SITE 
282-284; MYRISTYL 
49-54; 

CK2_PHOSPHO_SITE 
242-245; MYRISTYL 
72-77; 

PKC_PHOSPHO_SITE 
184-186; 

AMIDATION 52-55; 
MYRISTYL 121-126; 
AMIDATION 42-45; 
MYRISTYL 35-40; 
MYRISTYL 243-248: 



UBCc 83-226; UQ_con 
78-223; 
sp_Q9BTCl_Q9BTC!JHU 
MAN 97-222; 
ARG_RICH 5-59; 
UBIQUITIN_CONJUGAT 

1 153-167; 

UB I QUI T I N_CON JUGAT_ 

2 83-215; 
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208,1.059; 1 


MYRISTYL 228-233; 
CAMP_PHOSPHO_SITE 
279-282; f 












MYRISTYL 218-223; 












PKC PHOSPHO SITE 












116-118; 










163- 


CK2_PHOSPHO_SITE 










168,1.059; 


207-210; MYRISTYL 










6-23 . 1 . 154 ; 


215-220 ; 










41-48 . 1 083 : 


CK2 PHOSPHO SITE 




DEX0449_04 
1 . aa . 3 


N 


0 - ol- 
233; 


197- 

2 03.1.029: 
92-99 . 1 . 184 ; 
140- I 
153,1.184; 
176- 

183, 1.063; 
108- 


202-205 ; 

PKC PHOSPHO SITE 
144-146; 

CK2_PHOSPHO_SITE 
3-6; MYRISTYL 52- 
57; MYRISTYL 219- 
224; 

CK2_PHOSPHO_SITE 


UB I QUI T I N_C ON JUGAT_ 
2 14-175; 

UB I QUI T IN_CONJUGAT__ 
1 84-98; 

sp_Q9BTCl_Q9BTClJKU 
MAN 28-182; UBCc 
14-186; UQ_con 9- 
183 ; 








117, 1. 18; 


133-136; 








61-69,1.073; 


PKC_PHOSPHO_SITE 










71-88,1.083; 


191-193; MYRISTYIi 










119- 


188-193; 










137,1.121; 


CK2_PHOSPHO_SITE 
116-119; MYRISTYL 
203-208; MYRISTYL 
225-230; 












MYRISTYL 354-359*1 

LI X A\. X 0 X X XJ *J -J a ~J mj J \ 












AMIDATION 80-83 : 












PKC_PHOS PHO_S ITE 












283-285 : 










145- 


CK2_PHOSPHO_SITE 










162, 1.154 ; 


346-349; 










279- 


CK2_PH0SPH0_JSITE 










292, 1.184; 


255-258; 










336- 


PKC PHOSPHO SITE 










342, 1.029; 


255~257; MYRISTYL 










180- 


128-133 ; 










187, 1.083; 
315- 


CK2_PHOSPHO_SITE 
142-145 ; MYRISTYL 


OXYTOCINR 324-337; 
UBCc 153-325; 
UBIQUITIN_CONJUGAT_ 
2 153-314; 
UBIQUITIN_CONJUGAT_ 
1 223-237; UQ_Con 
148-322; 

sp_Q9BTCl_Q9BTClJKU 
MAN 167-321; 
OXYTOCINR 93-106; 


DEX0449_04 
l.orf .3 


N 


0 - ol- 
390; 


322 . 1 . 063 : 
231- 

238 1 184 - 
200- 

208 1 073 • 


342-347; 
AMIDATION 118- 
121 ; 

CAMP PHOSPHO SITE 
90-93 • MYRISTYL 

-7 w ; i 1 X AX W X X XJ 








247- 

256,1.18; 


191-196; 

CAMP_PH0S PHO_S I TE 








15-31,1.195; 
41-51,1.11; 


378-381; 

PKC PHOSPHO SITE 








65-90,1.129; 


381-383; 










210- 


PKC PHOSPHO SITE 










227,1.083; 


33-35; MYRISTYL 










258- 


85-90; 










276,1.121; 


CK2_PHOS PHO__S ITE 










302- 


272-275; 










307,1.059; 


PKC_PHOSPHO_SITE 
45-47; 

PKC_PHOSPHO_SITE 
330-332; 

CK2 PHOSPHO SITE 
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341-344; MYRISTYL; 
327-332; 



JDEX0449_ 
aa. 1 



04 



N 



0 - ol- 
110; 



59-66, 1. 064; 

23-35,1.2; 

87- 

107,1.135; 
43-51,1.055; 
4-19,1.144; 
71-80,1.11; 



PKC_PHOSPHO_SITE 
92-94; 

CAMP_PHOSPHO_SITE 
37-40; MYRISTYL 
2-7; 

PKC_PHOSPHO_SITE 
35-37; MYRISTYL 
56-61; 



PEX0449_ 
aa.l 



04 



N 



1 - il- 
95;tm96 

118;Oll 
9-257; 



48-55, 1.06; 
228- 

234,1.108; 
249- 

254,1.068; 
202- 

210,1.12; 
129- 

146,1.215; 

62-72,1.097; 

212- 

224, 1.14; 
157- 

169,1.121; 
185- 

191,1.092; 
94- 

118,1.175; 
76-82, 1.04; 
171- 

180,1.089; 
240- 

247,1.039; 
22-34,1.191; 



MYRISTYL 9-14; 

PKC_PHOSPHO_SITE 

214-216; 

PKC_PHOSPHO_SITE 
148-150; 

CK2_PHOSPHO_SITE 
210-213; 

PKC_PHOSPHOjSITE 
184-186; 

CK2_PHOSPHO_SITE 
201-204; 

PKCJPHOSPHO_SITE 
224-226; 



AhpC-TSA 67-216; 



DEX0449_ 
4 . aa . 1 



04 



N 



DEX0449_04 
4.orf .1 



N 



0 - il- 
105; 



74-81, 1. 122; 
95- 

102,1.095; 
24-48,1.119; 



0 - Ol- 
lll; 



30-54,1.119; 
80-87, 1.122; 
102- 

108, 1.095; 



CAMP_PHOS PHO_S I TE 
4-7; AMIDATION 
16-19; 
CAMP_PHOSPHO_SITE 
63-66; 

PKC_PHOSPHO_SITE 
91-93; 

ASN_GLYCOS YLAT I ON 
21-24; 
ASNjGLYCO S YLATI ON 
56-59; 

PKC_PHOSPHO_S I TE 
69-71; 

ASNJ3LYCOSYLATION 
65-68; 

PKC_PHOSPHO_SITE 
58-60; 



sp_P 0240 3_RL3 7_HUMA 
N 12-62; 

Ribosomal__L37e 10- 
63; RIBOSOMAL_L37E 
12-31; 



PKC_PHOSPHO_SITE 
75-77; 

ASN_GLYCOSYLATION 
71-74; 
ASN_GL YCO S YLATI ON 
27-30; AMIDATION 
22-25: 



Ribosomal_L37e 18- 
69; 

sp_P 024 03_RL3 7_HUMA 
N 18-68; 

RIBOSOMAL_L37E 18- 
37; 
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ASNJ3LYCOSYLATION 
62-65; 

PKC_PHOSPHO_SITE 
64-66; 

CAMP_PHOS PHO_SITE 
69-72; 

PKC_PHOSPHO_SITE 
97-99; 




DBX0449_04 
4.aa.2 


N 


0 - ol- 
104; 




ASN__GLYCOSYLAT I ON 
20-23; 

PKC_PHOSPHO_SITE 
2-4; 

ASN_GLYCOSYIiATION 
55-58; 

PKC_PHOSPHO_SITE 
57-59; AMI DAT I ON 
15-18; 

CK2 PHOSPHO SITE 
2-5; 

pimp pwriQPwn cttt? 
62-65; 

PKC PHOSPHO SITE 
4-6; 

PKC_PHOSPHO_SITE 
90-92; 

PKC_PHOSPHO_SITE 
68-70; 

ASN_GLYCOSYLATION 
64-67; 


Ribosomal_Ii37e 9- 
62; 

sp_P 024 03 _RL3 7_HUMA 
N 10-61; 

RIBOSOMAL L37E 11- 
30; 


DEX0449_04 
4.orf .2 


N 


0 - ol- 
129; 


— 


MYRISTYL 31-36; 

PKC_PHOSPHO_SITE 

94-96; 

PKC_PHOSPHO SITE 
2-4; MYRISTYL 50- 
55; 

ASN__GLYCOS YLAT ION 
35-38; 

PKC_PHOSPHO_SITE 
108-110; 

ASN_GLYCOSYLATION 
12-15; 

f xrrlvJ o fri\J o J. JL xii 

11-13; 

CK2 PHOSPHO SITE 
62-65; 

CK2_PHOSPHO_SITB 
108-111; 

PKC_PHOSPHOJS ITE 
20-22; 

PKC_PHOSPHO SITE 
10-12; 




DEX0449_04 
4 .aa.3 


N 


0 - il- 
96; 


15-39,1.104; 
87-93,1.095; 
4-9,1.129; 
65-72,1.122; 


ASN GLYCOSYLATION 
56-59; 

CAMP PHOSPHO SITE 
54-57; 

ASN_GLYCOSYIiATlON 
47-50; 

PKC_PHOSPHO_SITE 
82-84: 


Ribosomal L37e 7- 
54 ; 

sp__P 024 03_RL3 7_HUMA 
N 12-53; 
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PKC_PHOSPHO_SITE 
60-62; 

PKC_PHOSPHO_SITE 
49-51; J 




DEX0449_04 
4 . aa . 4 


N 


0 - il- 
55; 


23-36, 1. 118; 
39-45,1.072; 


PKC_PHOSPHO_SITE 
20-22; 

CK2_PHOSPHO_SITE 
34-37; 

PKC_PHOSPHO_S ITE 
34-36; 

CK2_PH0SPH0_SITE 
7-10; 




DEX0449_04 
4. orf .4 


M 


0 - ol- 
72; 


63-69, 1.095; 
4-15,1.104; 
41-48, 1. 122; 


PKCU?HOS PH0_S ITE 
58-60; 

CAMP PHOSPHORS ITE 
30-33; 

PKC_PHOSPHO_S ITE 
25-27; 

PKC_PHOSPH0_SITE 
36-38; 

ASN_GLYCOSYLATION 
23-26; 

ASNJ3LYC0SYLATI0N 
32-35; 


Ribosomal_L37e 1- 
30; 

sp Q9D823_Q9D823_MO 
QSE 1-29; 


DEX0449_04 
5 . aa . 1 


N 


0 - Ol- 
185; 


7-13,1.077; 
21-27,1.045; 
65-72, 1.166; 
42-56,1.188; 
13 7 - 

162,1.156; 
169- 

175, 1.06; 
100- 

106,1.164; 
110- 

127, 1.193 ; 


ASN_J3LYCOSYLATION 
98-101; MYRISTYL 
9-14; MYRISTYL 
178-183; 

ASN_GLYCO S YLATI ON 
61-64; 

CK2 PHOSPHO SITE 
106~109; 

PKC PHOSPHO SITE 
151-153; MYRISTYL 
14-19; 

PKC_PHOSPHO_SITE 
127-129; 

PKCJPHOSPHO_SITE 
77-79; MYRISTYL 
19-24; 




DEX0449_04 
5 . orf . 1 


N 


0 - Ol- 
214; 


194- 

211, 1.128; 
14-21, 1.059; 
182- 

191,1.098; 
86- 

101,1.148; 
49-67,1.235; 
72-83,1.199; 
105- 

113,1.105; 
120- 

180,1.17; 
29-47, 1. 107; 
4-11,1.148; 


MYRISTYL 71-76; 
ASN__GL YCO S YLAT ION 
147-150; MYRISTYL 
145-150 ; 

PKC PHOSPHO_SITE 
176-178; MYRISTYL 
150-155; 

PKC_PHOSPHO_SITE 
66-68; 

PKC PHOSPHO SITE 
34-36; 




DEX0449_04 
5 . aa . 2 




0 - Ol- 
131; 


7-13,1.077; 
21-27, 1.045; 
100- 


ASN_GLYCOSYLATION 
61-64; 

ASN GLYCOSYLATION 


RIBOSOMAL L18E 70- 
87; 

so 007020 RL18 HUMA 
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; 


L06, 1.164 ; 
35-72,1.166; 
42-56, 1. 188; 
L10- 

122,1.136; 


98-101; MYRISTYL 
L9-24; 

ZK2_PHOSPHOJSITE 
106-109; 

PKC PHOSPHORITE 
77-79; MYRISTYL 
9-14; MYRISTYL 
14-19; 


* 49-119; 

*ibosomal_L18e 31- 
131; 


DEX0449_04 
5 . or f . 2 


N 


0 - Ol- 

132; 


22-28,1.045; 
101- 

107,1.164; 
66-73,1.166; 
9-14,1.005; 
43-57, 1.188; 
111- 

123,1.136; 


ASN GLYCO S YLAT I ON 
62-65; 

ASN_GLYCOSYLATION 
99-102; MYRISTYL 
15-20; 

CK2_PHOSPHO_SITE 
107-110; MYRISTYL 
20-25; MYRISTYL 
10-15; MYRISTYL 
6-11; 

PKC_PHOSPHO_SITE 
78-80; 


RIBOSOMAL_L18E 71- 
88 ; Ribosomal_L18e 
32-132; 

sp_Q 0702 0_RL18_HUMA 
N 50-120; 


DEX0449_04 
5 . aa . 3 


N 


0 - ol- 
195; 


180- 

191,1.088; 
143- 

153,1.193; 
74-80 , 1 . 087 ; 
14-20, 1. 045; 
58-65, 1.166; 
94- 

100,1.115; 

35-49,1.188; 

160- 

172,1.133; 
115- 

141,1.177; 


ASN_GLYCOSYLATION 
54-57; MYRISTYL 
7-12; MYRISTYL 
172-177; MYRISTYL 
127-132; MYRISTYL 
12-17; MYRISTYL 
109-114; MYRISTYL 
189-194; 

PKC PHOSPHO SITE 
4-6; 

PKC_PHOSPHO_SITE 
70-72; 


sp — Q 9D 9 8 7_Q9D 9 8 7_MO 
USE 42-81; 
Ribosomal_L18e 24- 
160; RIBOSOMAL_L18E 
63-80; 


DEX0449_04 
6 . aa . 1 


Y 


0 - Ol- 
115; 


18-51, 1.229; 
54- 

109, 1.225; 


PKC PHOSPHORS ITE 
107-109; MYRISTYL 
34-39; MYRISTYL 
102-107; 

PKC_PHOSPHO_SITE 
38-40; MYRISTYL 
98-103; 

TYR_PHOSPHO_SITE 
40-48; 




DEX0449_04 
6 . aa . 2 


N 


0 - ol- 
156; 


91- 

126, 1.217; 
" 37-83,1.178; 
133- 

149, 1.145 ; 


MYRISTYL 126-131; 
MYRISTYL 6-11; 
CK2_PHOSPHO_SITE 
30-33; 

ASN__GLYCOS YLAT IOK 
28-31; MYRISTYL 
95-100; MYRISTYL 
38-43; 

PKC_PHOSPHO_SITE 
150-152; 

PKC_PHOSPHO_SITE 
9-11; MYRISTYL 
17-22; MYRISTYL 
14-19; 
llPKC PHOSPHO SITE 


r 

TIMP 13-148; NTR 
26-148; 
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3 


.27-129; [ 




DEX0449_04 
7 . aa . 1 




0 - ol- 
226; 


^ 

L41- 

L54, 1.109; ; 
159- 

176,1.17; J 
180- j 
210,1.214; 
42-57,1.155; * 
212- 

223,1.121; 
76- 

133,1.186; 
4-22, 1.064; 
32-40,1.093; 


IYRISTYIi 26-31; 1 

:k2 phospho_site 
71-74; myristyl 

il-26; MYRISTYL 
5-14 • 

:k2_phospho_site 

L32-135; 

:k2 phospho_site 
73-76; myristyl 

27-32; 

2K2 PH0SPH0_SITE 
57-70; MYRISTYL 
34-39; MYRISTYL 
44-49; 


rYCLIN 83-185; 
Wl 4 CHANNEL 19-28; 
\LA_RICH 2-22; 
^NTIFREEZEI 13-22; 
^NTIFREEZEI 1-11; 
KV1 4 CHANNEL 6-16; 
cyclin 48-192; 


DEX0449_04 
7 . orf . 1 


N 


0 - ol- 
253; 


103- 

125,1.18; 
141- 

154,1.056; 
4-13, 1. 152; 
176- 

182,1.102; 
158- 

167,1.11; 

37-48,1.194; 

57-88,1.216; 

90-96,1.106; 

15-24,1.127; 

197- 

203,1.06; 


PKC_PHOS PH0_S ITE 
229-231; 

CK2_PHOSPHO_SITE 
167-170; 

CAMP_PHOS PHOJS ITE 
232-235; 

PKC_PH0SPHO_SITE 
184-186; 

ASN_GLYCOSYLATION 
53-56; MYRISTYL 
143-148; MYRISTYL 
30-35; 

PKC_PHOSPHO_SITE 
180-182; 

TYR_PHOSPHO_SITE 
214-220; 

PKC PHOSPHO_SITE 
11-13; MYRISTYL 
202-207; 

PKC_PHOSPHOJSITE 
246-248; 

CK2_PHOSPHO_SITE 
98-101; MYRISTYL 
238-243; 

PKC_PHOSPHO_S ITE 
167-169; MYRISTYL 
242-247; 

CK2_PHOSPHO_SITE 
121-124; 

PKC PH0SPH0_SITE 
55-57; MYRISTYL 
190-195; 


SER_RICH 209-249; 
CYCLIN 13-111; 


DEX0449_04 
7. orf .2 


NT 


0 - ol- 
346; 


57-88, 1.216; 
176- 

182,1.102; 
103- 

125, 1.18; 
- 15-24,1.127; 
197- 

203,1.06; 4- 
13,1.152; 
37-48,1.194; 
280- 

287. 1.082: 


PKC_PHOSPHO_SITE 
180-182; 

TYR_PHOSPHO_SITB 
276-283; 

PKC PH0SPH0_SITE 
333-335; MYRISTYI 
190-195; 

PKC_PHOSPHO_SITE 
275-277; 

PKC_PHOSPHO_SITE 
295-297; MYRISTYI 
242-247: MYRISTYI 


HIGHMOBLTYIY 249- 
268; SER_RICH 209- 
3 02; HIGHMOBLTYIY 
222-233; 
, TUBBYPROTEIN 171- 
196; TUBBYPROTEIN 
285-311; 

HIGHMOBLTYIY 207- 
219; ARG_RICH 207- 

j339; CYCLIN 13-111; 

j 



WO 2004/050900 



PCTYUS2003/040131 



181 



315- 

320,1.033; 
158- 

167,1.11; 
141- 

154, 1.056; 
90-96,1.106; 



0-35; 
|PKC_PHOSPHO_SITE 
184-186; 

CK2_PHOSPHO_S ITE 
98-101; MYRISTYL 
238-243; 

CK2_PHOSPHO_SITE 
121-124; 

CAMP_PHOSPHO_SITE 
232-235; 
CAMP_PHOSPHO_SITE 
313-316; 

CK2_PHOSPHO__SITE 

167-170; MYRISTYL 

274-279; 

AMI DAT I ON 333- 

336; 

PKC_PHOSPHO_SITE 
229-231; 

PKC_PHOSPHO_SITE 
11-13; MYRISTYL 
143-148; 

ASN_GLYCO S YLAT I ON 
53-56; 

PKC_PHOS PHO_S ITE 
167-169; MYRISTYL 
202-207; 

TYR_PHOSPHO_SITE 
214-220; 

PKC_PHOSPHO_S ITE 
55-57; 

CK2_PHOSPHO_SITE 
252-255; 



DEX0449_04 
8 . aa . 1 



2 - il 
4 ; tm5 - 
27;o28 
52;tm53 

75;i76- 
91; 



63-85, 1. 21; 
40-61,1.119; 
13-28,1.179; 
6-11,1.065; 



DEX0449_04 
8 . or f . 1 



N 



0 - il 
92; 



12-52,1.184; 
59-89,1.161; 



MYRISTYL 59-64; 

PKC_PHOSPHO_SITE 

3-5; 

PKC_PHOSPHO_SITE 
47-49; 

PKC_PHOSPHO_SITE 
26-28; 



CD225 1-67; 



MYRISTYL 40-45; 
MYRISTYL 1-6; 
MYRISTYL 59-64; 
PKCJPHOSPHO_SITE 
55-57; 

TYR_PHOSPHO_SITE 
42-49; 



DEX0449_04 
9 . aa . 1 



N 



0 - ol 
137; 



66-73, 1. 106; 
27-39, 1.189; 
89-95,1.055; 
118- 

126,1.081; 
128- 

134,1.045; 
41-61, 1.185; 
75-83,1.108; 



CK2_PHOSPHO_S ITE 
107-110; 

PKC__PHOSPHO_SITE 
29-31; 

PKC_PHOSPHO_SITE 
64-66; MYRISTYL 
39-44; 

CK2_PHOSPHO_SITE 
6-9; 

CK2_PHOSPHO_SITE 
62-65; 

PKC PHOSPHO SITB 



Ribophorin_I 2-137; 
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|DEX0449_04 
.orf . 1 



0 - ol 
148; 



24-26; 

PKCJ?HOSPHO_ 

128-130; 

TYR_PHOSPHO_ 

108-115; 

PKC_PHOSPHO_ 

87-89; 



SITE 
SITE 
SITE 



MYRISTYL 50- 

PKC_PHOSPHO_ 

98-100; 

PKC_PHOSPHO_ 

35-37; 

CK2_PHOSPHO_ 

118- 121; 
PKC_PHOSPHO_ 
139-141; 
CK2_PHOSPHO_ 
22-25; 

TYR_PHOSPHO_ 

119- 126; 
CK2_PHOSPHO_ 
17-20; 
TYR_PHOSPH<D 
16-24; 
CK2_PHOSPHO_ 
73-76; 

KC_PHOSPHO. 
0-42; 
PKC_PHOSPH(D 
175-77; " 



55; 
SITE 

_SITE 

SITE 

_SITE 

SITE 
_SITE 
_SITE 
_SITE 
_SITE 
JSITE 

SITE 



Ribophorin__I 3-148; 



DEX0449_05 
0 . aa . 1 



0 - ol 
479; 



466- 




PKC_PHOSPHOJSITE 


476,1. 


085; 


369-371; 


230- 




CK2_PHOSPHO_SITE 


237,1. 


111; 


470-473; MYRISTYL 


394-403,1.1; 


124-129; MYRISTYL 


126- 




298-303; 


146,1. 


185; 


CK2_PHOSPHO_SITE 


5-17,1 


.227; 


453-456; MYRISTYL 


151- 




59-64; 


158,1. 


106; 


TYRJPHOSPHO_SITE 


253- 




391-398; 


261,1. 


094; 


CK2_PHOSPHO_SITE 


405- 




399-402; 


448,1. 


182; 


PKC_PHOSPHO_SITE 


160- 




63-65; AMIDATION 


168,1. 


108; 


462-465; 


300- 




PKCJPHOSPHO_SITE 


311,1. 


133; 


276-278; MYRISTYL 


174- 




22-27; 


180, 1. 


055; 


PKC_PHOSPHO_SITE 


67-76, 


1.118; 


213-215; 


328- 




PKCJPHOSPHO_SITE 


376,1. 


147; 


114-116; 


37-59,1.187; 


CK2_PHOSPHO_S ITE 


203- 




437-440; MYRISTYL 


211,1, 


081; 


60-65; 


112- 




PKC_PHOSPHO_SITE 


124,1. 


189; 


149-151; 


280- 




CK2 PHOSPHO SITE 



Ribophorin_I 30- 
437; 
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294,1.102; 

26-34,1.065; 

78-89,1.153; 



192- 195; 

ASN_GLYCOSYLATION 
299-302; 
CAMP_PHOSPHO_S ITE 
450-453; 

CK2_PHOSPHO_SITB 
147-150; 

PKC_PHOSPHO_SITE 
109-111; 

CK2_PHOS PHO_S ITE 
385-388; 

CK2_PHOSPHO_SITE 
262-265; 

PKC_PHOSPHO_SITE 
462-464; 

PKC_PHOSPHO_SITE 
279-281; MYRISTYL 
324-329; 

TYR_PHOSPHO_SITE 

193- 200; 

PKC_PHOSPHO_SITE 
172-174; 



DEX0449_05 
0 . aa . 2 



361; 



ol- 



160- 




168, 1. 


108; 


67-76, 


1.118; 


37-59, 


1.187; 


230- 




237,1. 


111; 


280- 




294,1. 


102; 


112- 




124,1. 


189; 


328- 




354,1. 


108; 


26-34, 


1.065; 


5-17,1 


.227; 


300- 




311,1. 


133; 


203- 




211,1. 


081; 


151- 




158,1. 


106; 


126- 




146,1. 


185; 


253- 




261,1. 


094; 


174- 




180,1. 


055; 


78-89, 


1 . 153 ; 



MYRISTYL 124-129; 

TYR_PHOSPHO_SITE 

193-200; 

PKC_PHOSPHO_SITE 
149-151; MYRISTYL 
324-329; MYRISTYL 
22-27; 

PKC_PHOSPHO_SITE 
172-174; 

CK2_PHOSPHO_SITE 
147-150; MYRISTYL 

298- 303; 

PKC_PHOSPHO_SITE 
114-116; 

PKC_PHOSPHO_SITE 
276-278; 

CK2_PHOSPHO_SITE 
192-195; 

PKC_PHOSPHO_SITE 
63-65; 

PKC_PHOSPHO_S ITE 
213-215; 

ASN__GLYCOS YLATI ON 

299- 302; 

PKC_PHOSPHO_SITE 
109-111; MYRISTYL 
60-65; 

PKC__PHOSPHO_SITE 
279-281; MYRISTYL 
59-64; 

CK2_PHOSPHO_SITE 
262-265; 



Ribophorin_I 3 0- 
361; 



DBX0449_05 
0 . aa . 3 



0 - ol- 
420; 



395- 

406,1.196; 
26-34,1.065; 
37-59,1.187; 
5-17.1.227: 



MYRISTYL 124-129; 
MYRISTYL 22-27; 
PKC_PHOSPHO_jSITE 
410-412; MYRISTYL 
377-382 : 



Ribophorin_I 3 0 - 
412; 
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z.n -i G. i 1 1 ft • 1 

D / - (D, l.llO, 1 


PKT* PWOQPWO CJTTT? I 
- xvv^ rnuornv ui id i 










L / ** — j 


172-1 74 • 1 










i p n i n^t; . 1 

LOU; 1 . UJJ ; 1 


"'TCP PWnQPWO QTTR 1 






















IIO; l.lOJ/ 1 


PTCP PHO<?PHO GTTR 1 
cr x\.v» rnuornu o a. a xii i 












1XX g 11 X XV AwX 1JJ 










417 1 041 - 

:±X/,X.V^A, I 


394-399 • MYRISTYIj 










£t J U 


CQ.gA . 
3 ^ O *x , 










o*J7 1 111 • 


PKC PHOSPHO SITE 












114-116 ; 










1j O f X • XU O f 


PHOSPHO SITE 










i n n - 












111 1 1 77 . 


TYp PHOSPHO STTE 

A X XV C IIUO AT XiW OA J.D 










*> ni — 


i Q*a _9 nn • 










Oil 1 flQI . 


PTCf 1 PHOCJpITO «3TTS 

XT XV v^ At XI WO At XI vy OX A A 










1 <^fl- 
lOU" 


A*± -7 Aj X , 










icq i inti- 
loo, J. . iu o ; 


nOlN v?AJ x v^vV O X Xxrl X X WAM 










*7Q QQ 1 1 C7 . 


OQP-7nO • MYT3TQTYT, 










328- 


60-65; 










376,1.147; 


PKC_PHOSPHO_SITE 










253- 


276-278; 










261,1.094; 


CK2_PHOSPHO_SITE 










280- 


147-150; MYRISTYL 










OQA 1 1Q2 • 


324-329 ? 










112- 


PKC_PHOSPHO_S ITE 










124, 1. 189 ; 


279-281; 










383- 


PKC_PH0SPHO_SITE 










393,1.162; 


63-65; MYRISTYL 
298-303; 

PKC PHOSPHO_SITE 
213-215; 

PKC_PHOSPHO_SITE 
369-371; 












MYRISTYL 33-38; 












MYRISTYL 116-121; 












PKC PHOSPHO SITE 




DEX044 9 05 
1 . aa . 1 


N 


0 - ol- 
12 8; 


8-13,1.039; 
68-76, 1.059; 
49-64, 1.126; 
104- 

117,1.058; 


106~108; 

PKC_PHOS PH0_S ITE 
24-26; MYRISTYL 
44-49; MYRISTYL 
91-96; 

CK2_PH0SPH0_SITE 
95-98; MYRISTYL 
14-19; MYRISTYL 
112-117; 










170- 


CK2_PHOSPHO_SITE 


ank 124-156; ANK 








187,1.133; 


160-163; MYRISTYL 


124-153; ank 57-89; 








93- 


81-86; 


ank 91-123; ANKYRIN 








115,1.194; 


ASN GLYCOS YLAT I ON 


137-149; ANKYRIN 








256- 


155-158; MYRISTYL 


125-137; 








263,1.083; 


136-141; 


ANK__RE P_REGI ON 21- 


DEX0449_05 


N 


0 - Ol- 


35-54,1.172; 


CK2_PHOSPHO_SITE 


179; ANK 21-50; ank 


2 . aa . 1 


315; 


58-81,1.224; 
294- 

309,1.089; 
267- 

273,1.122; 

17-30,1.123; 

199- 


245-248; MYRISTYL 
115-120; 

PKC_PHOSPHO_S ITE 
21-23; 

CK2_PHOSPHO_SITE 
15-18; MYRISTYL 
5-10: 


21-55; ANK_REPEAT_3 
124-156; 

ANK REPEAT_2 91- ! 
123; ANK_REPEAT_1 
57-89; ank 157-190; 
ANK 57-86; ANK 91- 
120; 
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208,1.083; | 
33-89, 1.046; [l 
159- | 
165,1.053; 
137- 

154,1.191; 
233- 

241,1.127; 1 


\SN_GLYCOSYLATION| 
35-88; 

PKC_PH0SPHO_SITE 
38-90; MYRISTYL 
196-201; 

^K2_PHOSPHO_SITE 
241-244; MYRISTYL 
192-197; MYRISTYL 
158-163; MYRISTYL 
223-228; MYRISTYL 
9-14; 

PKC_PHOSPHO_SITE 
210-212; 




DEX0449_05 
3.aa.l 


N | 


0 - ol- 
142; 


40-48,1.074; 
64-94,1.145; 
108- 

117,1.083; 


CAMP PHOSPHO_J3ITE 
17-20; 

PKC_PHOSPHO_S ITE 

20- 22; 

PKC_PH0S PH0J3 ITE 
132-134; MYRISTYL 
38-43; 

PKC_PHOSPHO_SITE 
137-139; 

PKC_PHOSPHO_SITE 
10-12; 

CAMP_PHOSPHO_S ITE 
135-138; 

PKC PHOSPHORS ITE 

21- 23; MYRISTYL 
116-121; MYRISTYL 
2-7; 

CK2 PHOSPHO_SITE 
93-96; 

CAMP_PHOS PHO_S ITE 
134-137; 

AMIDATION 15-18; 
AMIDATION 120- 
123; 




DEX0449_05 
3 . aa . 2 


N 


0 - Ol- 
72; 


14-32,1.094; 
40-46,1.064; 


CAMP_PHO S PHO_S ITE 
65-68; 

CK2_PHOSPHO_SITE 
38-41; 

CAMP_PHOSPHO_SITE 
64-67; MYRISTYL 
46-51; 

PKC_PHOSPHO_SITE 
62-64; 

PKC_PHOSPHO_SITE 
67-69; AMIDATION 
50-53; MYRISTYL 
2-7; 




DEX0449_05 
3 . orf .2 


N 


0 - ol- 
122; 


67- 

108,1.199; 

59-65,1.06; 

11-47,1.156; 


CK2_PHOSPHO_SITE 
1-4; AMIDATION 
115-118; 

ASN GLYCOSYLATION 
100-103; MYRISTYL 
13-18; 

PKC PHOSPHO_SITE 
59-61; MYRISTYL 
27-32: 
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^SN_GLYCOSYLATION 












L07-110; 












PKC_PHOSPHO_SITE 












95-97; MYRISTYL 






I 






53-58; 












VTYRISTYL 15-20; 












CK2_PH0SPH0_SITE 




DEX0449__05 
4 . aa . 1 


N 


0 - ol- 
42 ; 


5 -11 , 1 . 101 ; 
29-39 / 1 . 142 ; 


23-26; MYRISTYL 
11-16; MYRISTYL 
20-25; MYRISTYL 
6-11; MYRISTYL 
16-21; 












CK2_PHOS PH0_S ITE 












96-99; 












CK2 PH0SPH0_SITE 












145-148; 










54-80,1.148; 


PKC_PHOSPHO_SITE 










187- 


145-147; 










194,1.044; 


CK2_PHOSPHO_SITE 




DEX0449 05 
4 . orf . l" 


N 


0 - ol- 
260; 


197- 

207,1.183; 
86-96,1.133; 
34-52,1.074; 
118- 

126,1.089; 
14-24,1.051; 


91-94; 

PKC_PHOSPHO_SITE 
135-137; 

ASN^GLYCOS YLAT I ON 
129-132; 

PKC_PHOSPHO_SITE 
161-163; 

CAMP_PHOSPHOJSITE 
151-154; 

PKC_PHOSPHO_SITE 
150-152; 


GLU_RICH 211-258; 
ASP_RICH 178-260; 
NAP 12-209; 










PKC_PHOSPHO_SITE 












170-172; 












PKC_PHOSPHO_SITE 










111- 

121,1.133; 
212- 

219,1.044; 
79- 

105,1.148; 
4-33,1.166; 
39-45,1.051; 
222- 

232,1.183; 

59-77,1.074; 

143- 

151,1.089; 


160-162; 




DEX0449_05 
4 . oxf . 2 


N 


0 - Ol- 
285; 


PKC_PHOSPHO_SITE 
186-188; 

ASNJ3LYC0SYLATI0N 
154-157; 

CK2_PHOSPHO_SITE 
121-124; 

CK2_PH0SPH0_SITE 
1-4; 

CK2_PHOSPHO_SITE 
170-173; 

CAMP_PHOS PHO_S I TE 
176-179; 

PKC_PHOSPHO_SITE 
175-177; 

CK2_PHOSPHO_SITE 
116-119; 


ASP_RICH 2 03-285; 
NAP 37-234; 
GLU_RICH 236-283; 








462- 


CK2 PHOSPHO_SITE 




DEX0449_05 
5 . aa . 1 




0 - Ol- 
600; 


469,1.068; 
36-47,1.086; 
• 96- 
102,1.051; 
494- 


309-312; 

CK2_PHOSPHO__SITE 
245-248; 

ASN_GLYCOSYLATIOK 
112-115; MYRISTYI 


IP 368-376; IF_tail 
422-559; 

PRENYLATION 597- 
600; filament 25- 








506,1.08; 


515-520; 








263- 


CK2 PHOSPHO SITE 
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272,1. 


055; 


59-62; 


170- 




PKC_PHOSPHO_S ITE 


176,1. 


091; 


408-410; 


248- 




ASN_GLYCOS YLAT I ON 


261,1. 


123; 


470-473; MYRISTYL 


541- 




433-438; 


547,1. 


129; 


TYR_PHOSPHO_SITB 


189- 




291-299; MYRISTYL 


198,1. 


061; 


435-440; 


427- 




PKC_PHOSPHO_SITE 


434,1. 


067; 


226-228; 


219- 




PKC_PHOSPHO__SITE 


227,1. 


108; 


65-67; 


508- 




PKC_PHOSPHO_SITE 


530,1. 


101; 


472-474; 


13-19, 


1.061; 


PKC_PHOSPHO_SITE 


295- 




498-500; 


309,1 


102; 


PKC_PHOSPHO_SITE 


83-89, 


1.053; 


148-150; 


52-58 


1.102; 


PKC_PHOSPHO_SITE 


154- 




114-116; 


166, 1 


.071; 


TYR_PHOSPHO_SITE 


69-79 


,1.095; 


255-262; 


348- 




CK2JPHOSPHOJ3ITE 


365,1 


.142; 


204-207; 


484- 




PKC PHOSPHO_SITE 


490,1 


.068; 


90-92; 


441- 




PKC_PHOSPHO_SITE 


458, 1 


.086; 


414-416; MYRISTYL 


382- 




403-408; MYRISTYL 


397, 1 


.091; 


426-431; 


122- 




PKC_PHOSPHO_SITE 


148, 1 


.132; 


389-391; 



PKC_PHOSPHO_SITE 
556-558; 

CAMP_PHOSPHO_SITE 
558-561; MYRISTYL 
506-511; 

PKC_PHOSPHO_SITE 
59-61; 

CK2_PHOSPHO_SITE 
65-68; 

PKC_PHOSPHO_SITE 
268-270; 

CK2_PHOSPHO_SITE 
455-458; 

CK2_PHOSPHO_SITE 
194-197; 

PKC_PHOSPHO_SITE 
213-215; 

CK2_PHOSPHO_S ITE 
139-142; MYRISTYL 
535-540; 

CK2_PHOSPHO_SITB 
472-475; 

CK2JPHOSPHO_S ITE 
90-93; 

PKC_PHOSPHO_S ITE 
539-541; 
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PKC_PHOSPHO_S ITE 

93-595; MYRISTYL 
437-442; 



DEX0449_05|| 
5 . orf . 1 



47- 

54, 1.067 
368- 

385,1-142 
315- 

329,1.102 
142- 

168,1.132 
33-39,1.061; 
116- 

122, 1. 051; 

03- 
109,1.053; 
402- 

417,1.091; 
504- 

510,1.068; 
461- 

478,1.086; 
209- 

218,1.061; 

72-78,1.102; 

482- 

489,1.068; 

89-99,1.095; 

528- 

550,1.101; 
174- 

186,1.071; 

56-67,1.086; 

190- 

196,1.091; 
239- 

247,1.108; 
268- 

281,1.123; 
514- 

526,1.08; 
283- 

292,1.055; 
561- 

567, 1.129; 
22-27,1.014; 



PKC_PHOSPHO_S ITB 
134-136; 

PKC_PHOSPHO_SITB 
409-411; MYRISTYL 
53-458; MYRISTYL 
446-451; 

PKC_PHOSPHO_S ITE 
576-578; 

PKC_PHOSPHO_SITE 
246-248; 

CK2_PHOSPHO_SITE 
85-88; 

PKC_PHOSPHO_S ITE 

88-290; 
CK2_PHOSPHO_S ITE 

14-217; MYRISTYL 
555-560; 

CK2_PHOSPHO_SITE 
492-495; 

PKC_PHOSPHO_SITE 
168-170; 

PKC_PHOSPHO_SITE 
613-615; 

PKCJPHOSPHO_SITE 
428-430; 

TYR_PHOSPHO_SITE 
275-282; 

PKC_PHOSPHO_SITE 
434-436; 

CK2_PHOSPHO_SITE 
159-162; 

PKC_PHOSPHO_SITE 
518-520; 

TYR_PHOS PHO_S I TE 
311-319; 

CK2_PHOSPHO_SITE 
475-478; 

CAMP_PHOSPHO_S ITE 
578-581; 

PKC_PHOSPHO_SITE 
492-494; 

CK2_PHOSPHO_SITE 
224-227; 

ASN_GLYCOSYLATION 
490-493; 

PKC_PHOSPHO_SITE 
233-235; 

ASNJ3LYCOSYLATION 
132-135; 

CK2_PHOSPHO_SITE 
265-268; MYRISTYL 
423-428; 

CK2_PH0SPHO_SITE 
329-332; 

PKC_PHOSPHO_SITE 
79-81: 



PRENYLATION 617- 
620; IF_tail 442- 
579; IF 388-396; 
filament 45-401; 
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DEX0449_05 
5 . aa . 2 



N 



237; 



DEX0449_05 
5 . orf . 3 



Ol 



0 - ol 
620; 



CK2JPHOSPHO_SITE 
79-82; MYRISTYIi 
455-460; 

PKC_PHOSPHO_SITE 
110-112; MYRISTYIi 
535-540; 
AMIDATION 6-9; 
PKC_PHOSPHO_SITE 
559-561; 

PKC_J»HOSPHO_SITE 
85-87; 

CK2_PHOSPHO_SITE 
110-113; MYRISTYIi 
526-531; MYRISTYL 
457-462; 



174- 

186,1.071; 

33-39,1.061; 

199- 

207,1.122; 
218- 

234,1.134; 
103- 

109,1.053; 

56-67,1.086; 

142- 

168,1.132; 

89-99,1.095; 

22-27,1.014; 

72-78,1.102; 

190- 

196,1.091; 
116- 

122,1.051; 



482- 

489,1.068; 
268- 

281,1.123; 

22-27,1.014; 

561- 

567,1.129; 
56-67, 1.086; 
174- 

186,1.071; 
368- 

385,1.142; 
528- 

550,1.101; 
447- 
454,1.067; 
72-78,1.102; 
283- 



CK2_PHOSPHO_SITE 

9-82; 
PKC_PHOSPHO_SITE 
110-112; 

CK2JPHOSPHO_SITE 
85-88; 

ASN_GLYCOSYIiATION 
132-135; 
AMIDATION 6-9; 
PKC JPHOS PHO_S I TE 
79-81; 

PKC_PHOSPHO_SITE 
85-87; MYRISTYIi 
201-206; 

PKC_PHOSPHO_SITE 
205-207; 

PKC_PHOSPHO_SITE 
168-170; 

CK2_PHOSPHO_SITE 
110-113; 

CK2_PHOSPHO_SITE 
159-162; 

PKC_PHOSPHO_SITE 
134-136; 

CK2_PHOSPHO_SITE 
210-213; 



filament 45-232; 



CAMP_PHOSPHO_SITE 
578-581; 
TYR_PHOSPHO_SITE 
275-282; 

CK2_PHOSPHO_SITE 
329-332; 
AMIDATION 6-9; 
CK2_PHOSPHO_SITE 
475-478; 

PKC_PHOSPHO_SITE 
13-615; 

KC_PHOSPHO_SITE 
85-87; MYRISTYL 
446-451; 

CK2_PHOSPHO_SITE 
110-113; MYRISTYIi 
555-560; 

CK2 PHOSPHO SITE 



PRENYIiAT I ON 617 
620; IF 388-396 
filament 45-401 
IF tail 442-579 
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292,1.055; 
209- 

118,1.061; 

89-99,1.095; 

514- 

526,1.08; 

33-39,1-061; 

461- 

478,1.086; 
504- 

510,1.068; 
$15- 
329,1.102; 
103- 

109,1.053; 
239- 

247,1.108, 
102- 

117,1.091, 
190- 

196,1.091, 
142- 

168,1.132, 
116- 

122, 1.051; 



79-82; 

CK2_PHOSPHO_SITE 
85-88; 

PKC_PHOSPHO_SITE 
559-561; 

TYR_PHOS PHO_S I TE 
311-319; 

PKC_PHOSPHO_SITE 
79-81; 

PKC_PHOS PHO_S ITE 
168-170; MYRISTYL 
457-462; 

CK2JPH0SPH0_SITE 
159-162; 

| PKC_PHOSPHO_SITE 
134-136; MYRISTYL 
455-460; MYRISTYL 
453-458; 

ASN_GL YCO S YLAT I ON 
132-135; 

CK2_PHOSPHO_SITE 
492-495; 

&SN__GLYCOSYLATION 
490-493; 

PKC_PHOSPHO_SITE 
428-430; 

CK2_PHOSPHO_SITB 
214-217; MYRISTYL 
526-531; 

PKC_PHOSPHO__SITE 
576-578; 

PKC_PHOSPHO_S ITE 
409-411; 

PKC_PHOSPHO_SITB 
288-290; 

PKC_PHOSPHO_SITB 
110-112; 

PKC__PHOSPHO_SITE 
434-436; MYRISTYL 
535-540; 

PKC_PHOSPHO_SITE 
233-235; MYRISTYL 
423-428; 

PKC_PHOSPHO_S ITE 
492-494; 

CK2_PHOSPHO_SITE 
265-268; 

PKC_PHOSPHO_SITE 
518-520; 

CK2_PHOSPHO_SITE 
224-227; 

PKC_PHOS PHO_S ITE 
246-248; 



Example lb: Sequence Alignment Support 
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Alignments between previously identified sequences and splice variant sequences 
are performed to confirm unique portions of splice variant nucleic acid and amino acid 
sequences. The alignments are done using the Needle program in the European Molecular 
Biology Open Software Suite (EMBOSS) version 2.2.0 available at www.emboss.org 

5 from EMBnet (http://www.embnet.org). Default settings are used unless otherwise noted. 
The Needle program in EMBOSS implements the Needleman-Wunsch algorithm. 
Needleman, S. B., Wunsch, C. D., J. Mol. Biol. 48:443-453 (1970). 

It is well know to those skilled in the art that implication of alignment algorithms 
by various programs may result in minor changes in the generated output. These changes 

1 0 include but are not limited to: alignment scores (percent identity, similarity, and gap), 
display of nonaligned flanking sequence regions, and number assignment to residues. 
These minor changes in the output of an alignment do not alter the physical characteristics 
of the sequences or the differences between the sequences, e.g. regions of homology, 
insertions, or deletions. 

1 5 Example lc: RT-PCR Analysis 

To detect the presence and tissue distribution of a particular splice variant Reverse 
Transcription-Polymerase Chain Reaction (RT-PCR) is performed using cDNA generated 
from a panel of tissue RNAs. See, e.g. , Sambrook et al. , Molecular Cloning: A Laboratory 
Manual. 2d ed., Cold Spring Harbor Laboratory Press (1989) and; Kawasaki ES et al, 

20 PNAS 85(15):5698 (1988). Total RNA is extracted from a variety of tissues and first 

strand cDNA is prepared with reverse transcriptase (RT). Each panel includes 23 cDNAs 
from five cancer types (lung, ovary, breast, colon, and prostate) and normal samples of 
testis, placenta and fetal brain. Each cancer set is composed of three cancer cDNAs from 
different donors and one normal pooled sample. Using a standard enzyme kit from BD 

25 Bioscience Clontech (Mountain View, CA), the target transcript is detected with 

sequence-specific primers designed to only amplify the particular splice variant. The PCR 
reaction is run on the GeneAmp PCR system 9700 (Applied Biosystem, Foster City, CA) 
thermocycler under optimal conditions. One of ordinary skill can design appropriate 
primers and determine optimal conditions. The amplified product is resolved on an 

30 agarose gel to detect a band of equivalent size to the predicted RT-PCR product. A hand 
indicated die presence of the splice variant in a sample. The relation of the amplified 
product to the splice variant was subsequently confirmed by DNA sequencing. 
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After subcloning, all positively screened clones are sequence verified. The DNA 
sequence verification results show the splice variant contains the predicted sequence 
differences in comparison with the reference sequence. 

Results for RT-PCR analysis include the sequence DEX ID, Lead Name, Cancer 
5 Tissue(s) the transcript was detected in, Normal Tissue(s) the transcript was detected in, 
the predicted length of the RT-PCR product, and the Confirmed Length of the RT-PCR 
product 

RT-PCR results confirm the presence SEQ ID NO: 1-1 12 in biologic samples and 
distinguish between related transcripts. 

10 Example Id: Secretion Assay 

To determine if a protein encoded by a splice variant is secreted from cells a 
secretion assay is preformed. A pcDNA3. 1 clone containing the gene transcript which 
encodes the variant protein is transfected into 293T cells using the Superfect transfection 
reagent (Qiagen, Valencia CA). Transfected cells are incubated for 28 hours before the 

1 5 media is collected and immediately spun down to remove any detached cells. The 
adherent cells are solubilized with lysis buffer (1% NP40, lOmM sodium phosphate 
pH7.0, and 0.1 5M NaCl). The lysed cells are collected and spun down and the 
supernatant extracted as cell lysate. Western immunoblot is carried out in the following 
manner: 15|ol of the cell lysate and media are run on 4-12% NuPage Bis-Tris gel 

20 (Invitrogen, Carlsbad CA), and blotted onto a PVDF membrane (Invitrogen, Carlsbad 

CA). The blot is incubated with a polyclonal primary antibody which binds to the variant 
protein (Imgenex, San Diego CA) and polyclonal goat anti-rabbit-peroxidase secondary 
antibody (Sigma-Aldrich, St. Louis MO). The blot is developed with the ECL Plus 
chemiluminescent detection reagent (Amersham Biosciences, Piscataway NJ). 

25 Secretion assay results are indicative of SEQ ID NO: 1 1 3-259 being a diagnostic 

marker and/or therapeutic target for cancer. 

Example 2a: Gene Expression Analysis 

Custom Microarray Experiment - Cancer 

Custom oligonucleotide microarrays were provided by Agilent Technologies, Inc. 
30 (Palo Alto, CA). The microarrays were fabricated by Agilent using their technology for 
the in-situ synthesis of 60mer oligonucleotides (Hughes, et al. 2001, Nature Biotechnology 
19:342-347). The 60mer microarray probes were designed by Agilent, from gene 
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sequences provided by diaDexus, using Agilent proprietary algorithms. Whenever 
possible two different 60mers were designed for each gene of interest. 

All microarray experiments were two-color experiments and were preformed using 
Agilent-recommended protocols and reagents. Briefly, each microarray was hybridized 
5 with cRNAs synthesized from RNA (total RNA for ovarian and prostate, polyA+ RNA for 
lung, breast and colon samples), isolated from cancer and normal tissues, labeled with 
fluorescent dyes Cyanine3 (Cy3) or Cyanine5 (Cy5) (NEN Life Science Products, Inc., 
Boston, MA) using a linear amplification method (Agilent). In each experiment the 
experimental sample was RNA isolated from cancer tissue from a single individual and 
10 the reference sample was a pool of RNA isolated from normal tissues of the same organ as 
the cancerous tissue (i.e. normal ovarian tissue in experiments with ovarian cancer 
samples). Hybridizations were carried out at 60°C, overnight using Agilent in-situ 
hybridization buffer. Following washing, arrays were scanned with a GenePix 4000B 
Microarray Scanner (Axon Instruments, Inc., Union City, CA). The resulting images were 
15 analyzed with GenePix Pro 3.0 Microarray Acquisition and Analysis Software (Axon). 
Data normalization and expression profiling were done with Expressionist 
software from GeneData Inc. (Daly City, CA/Basel, Switzerland). Gene expression 
analysis was performed using only experiments that met certain quality criteria. The 
quality criteria that experiments must meet are a combination of evaluations performed by 
20 the Expressionist software and evaluations performed manually using raw and normalized 
data. To evaluate raw data quality, detection limits (the mean signal for a replicated 
negative control + 2 Standard Deviations (SD)) for each channel were calculated. The 
detection limit is a measure of non-specific hybridization. Acceptable detection limits 
were defined for each dye (<80 for Cy5 and <150 for Cy3). Arrays with poor detection 
25 limits in one or both channels were not analyzed and the experiments were repeated. To 
evaluate normalized data quality, positive control elements included in the array were 
utilized. These array features should have a mean ratio of 1 (no differential expression). 
If these features have a mean ratio of greater than 1.5-fold up or down, the experiments 
were not analyzed further and were repeated. In addition to traditional scatter plots 
30 demonstrating the distribution of signal in each experiment, the Expressionist software 
also has minimum thresholding criteria that employ user defined parameters to identify 
quality data. These thresholds include two distinct quality measurements: 1) minimum 
area percentage, which is a measure of the integrity of each spot and 2) signal to noise 
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ratio, which ensures that the signal being measured is significantly above any background 
(nonspecific) signal present. Only those features that met the threshold criteria were 
included in the filtering and analyses carried out by Expressionist The thresholding 
settings employed require a minimum area percentage of 60% [(% pixels > background + 

5 2SD)-(% pixels saturated)], and a minimum signal to noise ratio of 2.0 in both channels. 
By these criteria, very low expressors, saturated features and spots with abnormally high 
local background were not included in analysis. 

Relative expression data was collected from Expressionist based on filtering and 
clustering analyses. Up-regulated genes were identified using criteria for the percentage 

10 of experiments in which the gene is up-regulated by at least 2-fold. In general, up- 
regulation in -30% of samples tested was used as a cutoff for filtering. 

Two microarray experiments were preformed for each normal and cancer tissue 
pair. The tissue specific Array Chip for each cancer tissue is a unique microarray specific 
to that tissue and cancer. The Multi-Cancer Array Chip is a universal microarray that was 

15 hybridized with samples from each of the cancers (ovarian, breast, colon, lung, and 

prostate). See the description below for the experiments specific to the different cancers. 

Microarray Experiments and Data Tables 
COLON CANCER CHIPS 

For colon cancer two different chip designs were evaluated with overlapping sets 
20 of a total of 3 8 samples, comparing the expression patterns of colon cancer derived 
polyA+ RNA to polyA+ RNA isolated from a pool of 7 normal colon tissues. For the 
Colon Array Chip all 38 samples (23 Ascending colon carcinomas and 15 Rectosigmoidal 
carcinomas including: 5 stage I cancers, 15 stage II cancers, 15 stage m and 2 stage IV 
cancers, as well as 28 Gradel/2 and 10 Grade 3 cancers) were analyzed. The 
25 histopathologic grades for cancer are classified as follows: GX, cannot be assessed; Gl, 
well differentiated; G2, Moderately differentiated; G3, poorly differentiated; and G4, 
undifferentiated. AJCC Cance r Sta ping Handbook. 5 th Edition, 1998, page 9. For the 
Colon Array Chip analysis, samples were further divided into groups based on the 
expression pattern of the known colon cancer associated gene Thymidilate Synthase (TS) 
30 (1 3 TS up 25 TS not up). The association of TS with advanced colorectal cancer is well 
documented. Paradiso et aL 9 Br J Cancer 82(3):560-7 (2000); Etienne et aU J Clin 
Oncol 20(12):2832-43 (2002); Aschelee* al Clin Cancer Res. 6(12):4797-802 (2000). 
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For the Multi-Cancer Array Chip a subset of 27 of these samples (14 Ascending colon 
carcinomas and 13 Rectosigmoidal carcinomas including: 3 stage I cancers, 9 stage II 
cancers, 13 stage III and 2 stage IV cancers) were assessed. 

The results for the statistically significant up-regulated genes on the Colon Array 

5 Chip are shown in Tables 1 and 2. The results for the statistically significant up-regulated 
genes on the Multi-Cancer Array Chip are shown in Table 3. 

The first two columns of each table contain information about the sequence itself 
(Seq ID, Oligo Name), the next columns show the results obtained for all ("ALL") the 
colon samples, ascending colon carcinomas ("ASC"), Rectosigmoidal carcinomas ("RS"), 

10 cancers corresponding to stages I and H ("ST1,2"), stages m and IV ("ST3,4"), grades 1 
and 2 ("GR1,2"), grade 3 ("GR3"), cancers exhibiting up-regulation of the TS gene 
("TSup") or those not exhibiting up-regulation of the TS gene ("NOT TSup"). <%up' 
indicates the percentage of all experiments in which up-regulation of at least 2-fold was 
observed n=38 for the Colon Array Chip (n=27 for the Multi-Cancer Array Chip), '%valid 

1 5 up' indicates the percentage of experiments with valid expression values in which up- 
regulation of at least 2-fold was observed. 



Table 1. 



DEX ID 


Oligo 
Name 


Cln 
ALL 
%up 
n=38 


Cln 
ALL % 
valid 
up 
n=3 8 


Cln 
ASC 
%up 
n=23 


Cln 
ASC % 
valid 
up 

n=23 


Cln 
RS 
%up 
n=15 


Cln 
RS % 
valid 
up 

n»15 


Cln 
ST1,2 
%up 
n=20 


Cln 

ST1,2 

% 

valid 
up 

n=20 


Cln 
ST3,4 
%up 
n=18 


Cln 
ST3 , 4 
%valid 
up 

n=18 


DEX0449_ 
OOl.nt .1 


10296.0 


44.7 


44.7 


60.9 


60.9 


20.0 


20.0 


45.0 


45.0 


44.4 


44.4 


DEX0449_ 
OOl.nt .1 


10297.0 


63.2 


63 .2 


73.9 


73 .9 


46.7 


46.7 


65.0 


65.0 


61.1 


61.1 


DEX0449_ 
044.nt.l 


33264.0 


5.3 


8.0 


0.0 


0.0 


13 .3 


25.0 


0.0 


0.0 


11.1 


15.4 


DEX0449_ 
044.nt.l 


33265.0 


10.5 


11.8 


4.3 


5.0 


20.0 


21.4 


5.0 


5.6 


16.7 


18.8 


DEX0449_ 
044. nt. 2 


33264.0 


5.3 


8.0 


0.0 


0.0 


13 .3 


25.0 


0.0 


0.0 


11.1 


15.4 


DEX0449_ 
044. nt .2 


33265.0 


10.5 


11.8 


4.3 


5.0 


20.0 


21.4 


5.0 


5.6 


16.7 


18.8 


DEX0449_ 
044 .nt .3 


33264.0 


5.3 


8.0 


0.0 


0.0 


13 .3 


25.0 


0.0 


0.0 


11.1 


15.4 


DEX0449_ 
044. nt. 3 


33265.0 


10.5 


11.8 


4.3 


5.0 


20.0 


21.4 


5.0 


5.6 


16.7 


18.8 


DEX0449_ 
044. nt .4 


33264.0 


5.3 . 


8.0 


0.0 


0.0 


13 .3 


25.0 


0.0 


0.0 


11.1 


15.4 


DEX0449_ 
044. nt. 4 


33265.0 


10.5 


11.8 


4.3 


5.0 


20.0 


21.4 


5.0 


5.6 


16 .7 


18.8 


DEX0449_ 
046. nt .1 


33720.0 


36.8 


36.8 


43.5 


43.5 


26.7 


26.7 


35.0 


35.0 


38.9 


38.9 
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DEX0449 
047. nt .1 


.2809.0 3 


.5.8 3 


.5.8 3 


.3.0 3 


.3.0 2 


iO.O 2 


!0 . 0 E 


>.0 £ 


> . 0 2 


17.8 2 


.7.8 


DEX0449 
047. nt .1 


L2810.0 3 


L5.8 3 


L5.8 3 


L3 . 0 3 


L3 .0 2 


20.0 2 


JO . 0 £ 


> . 0 E 


> . 0 2 


1 «■» o «* 


> «-T Q 


DEX0449 
047. nt. 2 


L2809.0: 


L5.8 3 


L5.8 3 


L3 .o : 


L3 .0 : 


20.0; 


20.0 ! 


5.0 I 


> . 0 2 


27.8 * 


5 "I Q 

5 / . O 


DEX0449 
047. nt. 2 * 


L2810.0 : 


L5.8 : 


L5.8 3 


L3.0 : 


L3 .0 : 


20 . o : 


20.0 I 


5.0 I 


> . 0 4 


27.8 4 


27.8 


DEX0449 
048.nt.l ' 


28637.0' 


73 .7 ' 


73 .7 1 


32.6 1 


32 .6 < 


50. 0 < 


50.0 


70.0 


70.0 


77.8 


77 . 8 


DEX044 9 
048.nt.l ' 


28638.0 ( 


55.8 < 


55.8 ' 


73.9 


73 .9 


53 .3 . 


53 .3 


70.0 


70.0 < 


51 . 1 


51.1 


DEX044 9 
049.nt.l 


10224.0 


13.2 


13 .9 


17.4 


18 .2 


5.7 


7.1 


10 . 0 


10 . 0 


16 . 7 


18 . 8 


DEX0449 
049. nt .1 


10225.0 


7.9 


8.8 


8.7 


9.5 


6.7 


7.7 


0 . 0 


0 . 0 


16 . 7 


20 . 0 


DEX0449 
051. nt .1 


28403.0 


10.5 


10.5 


17.4 


17 .4 


0.0 


0 . 0 


10 . 0 


10 - 0 


11 . 1 


11 . 1 


DEX044 9 
052. nt .1 


16385.0 


21.1 


21.1 


21.7 


21.7 


20. 0 


20 . 0 


20 . 0 


20 . 0 


22 . 2 


22 . 2 


DEX044 9 

053.nt.l 


17654.0 


5.3 


5.6 


4.3 


4.5 


6.7 


7 . 1 


0.0 


0 . 0 


11 . 1 


11 . 1 


TVRX0449 

053 .nt .1 


17655.0 


13.2 


13.9 


17.4 


19.0 


6.7 


6.7 


10. 0 


10.5 


16 .7 


17 . 6 


LTiAUt 1 ! -7 

053 .nt .2 


17654.0 


5.3 


5.6 


4.3 


4.5 


6.7 


7.1 


0.0 


0.0 


11.1 


11 . 1 


DEXQ44 9 

053. nt .2 


17655.0 


13.2 


13 .9 


17.4 


19.0 


6.7 


6.7 


10. 0 


10 . 5 


16 . 7 


17 . 6 


DEX0449 
054 .nt .1 


37943.0 


52.6 


52.6 


56.5 


56.5 


46 .7 


46.7 


50 . 0 


50 . 0 


55 . 6 


55 . 6 


DEX0449 
054. nt .1 


37944.0 


50.0 


51.4 


56.5 


56.5 


40.0 


42 .9 


50.0 


52 . 6 


50.0 


50 . 0 


DEX0449 
054 .nt .2 


37943.0 


52.6 


52.6 


56.5 


56.5 


46.7 


46.7 


50.0 


50.0 


55 .6 


55 . 6 


DEX0449 

054. nt. 2 


37944.0 


50.0 


51.4 


56.5 


56.5 


40.0 


42.9 


50.0 


52 .6 


50 . 0 


50 . 0 


nwy 04 4 9 
055.nt.l 


30951.0 


65.8 


65.8 


69.6 


69.6 


60.0 


60.0 


65 . 0 


65 . 0 


66 . 7 


66 . 7 


T")EX044 9 

055.nt.l 


30956.0 


55.3 


56.8 


69.6 


72.7 


33 .3 


33 .3 


50.0 


52.6 


61 . 1 


61 . 1 


DEX0449 
055 .nt .1 


31201.0 


65.8 


65.8 


69.6 


69.6 


60.0 


60.0 


70 . 0 


70 . 0 


61 . 1 


61 . 1 


DEX0449 

055. nt. 2 


30951.0 


65.8 


65.8 


69.6 


69.6 


60.0 


60.0 


65.0 


65.0 


66.7 


66 . 7 


DEX0449_ 
055 -nt. 2 


30956 . C 


I 55 . 3 


56 . 8 


69 . 6 


72 . 7 


33 . 3 


33.3 






Dial 


Dl • J. 


DEX0449_ 
055. nt. 2 


31201. C 


) 65.8 


65.8 


69.6 


69.6 


60.0 


60.0 


70.0 


70.0 


61.1 


61.1 


DEX0449_ 
055. nt .3 


30951. C 


) 65.8 


65.8 


69.6 


69.6 


60. C 


60.0 


65.0 


65.0 


66.7 


66.7 


DEX0449_ 
055. nt. 3 


30956. ( 


) 55.3 


56.8 


69.6 


" 72.7 


33.3 


33 .3 


50.0 


52.6 


61.1 


61.1 


DEX0449_ 
055 .nt .3 


31201. ( 


) 65.8 


65.8 


69.6 


I 69.6 


60. C 


> 60.0 


70.0 


70.0 


61.1 


61.1 
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DBX ID 


1 

( 

Dligo < 
flame ' 
i 


:in [ 

3R1,2 
1=28 


-In ( 
3R1,2 j 
fevalid 

* p i 

n — O Q 


:in * 

3R3 j 

feup 

[1=10 


Zln ( 
3R3 , 
svalia 

ap ] 


2ln \ 
TS up 1 

*up . 
a=13 1 


2Xn TS j 
ap 

cvaiiQ 
-*P 

n=13 

] 


:in ( 

tfOT I 

rs \ 

HP 

fcup 1 
a=25 ] 


:in 

tfOT TS 
^P 

fc valid 
^P 

n=25 


DEX0449 OOl.nt.l 


10296.0. 


35.7 


35.7 


70.0 


70.0 


61.5 


SI. 5 


36.0 


36.0 


DBX0449 OOl.nt.l 


10297.0 


53 .6 


53.6 


90.0 


90.0 


84 .6 


84.6 


52.0 


52.0 


DEX0449 044.nt.l 


33264.0 


7.1 


11.1 


0.0 


0.0 


0.0 


0.0 


6.0 


11.1 I 


DEX0449 044.nt.l 


33265.0 


10.7 


11.5 


10.0 


12.5 


7.7 


11.1 ! 


12.0 


12.0 


DEX0449 044. nt. 2 


33264.0 


7.1 


11.1 


0.0 


0.0 


0.0 


0.0 


8.0 


11.1 


DEX0449 044. nt. 2 


33265.0 


10.7 


11.5 


10.0 


12.5 


7.7 


11.1 


12.0 


12 .0 


DEX0449 044. nt. 3 


33264.0 


7.1 


11.1 


0.0 


0.0 


0.0 


0.0 


8.0 


11.1 ! 


DEX0449 044. nt. 3 


33265.0 


10.7 


11.5 


10.0 


12 .5 


7.7 


11.1 


12.0 


12.0 


DEX0449 044. nt. 4 


33264.0 


7.1 


11.1 


0.0 


0.0 


0.0 


0.0 


8.0 


11.1 


DEX0449 044. nt. 4 


33265.0 


10.7 


11.5 


10.0 


12.5 


7.7 


11.1 


12.0 


12.0 


DEX0449 046.nt.l 


33720. 0 


28.6 


28.6 


60.0 


60.0 


38.5 


38.5 


36.0 


36.0 


DEX0449 047.nt.l 


12809.0 


10.7 


10.7 


30.0 


30.0 


15.4 


15.4 


16.0 


16.0 ! 


DEX0449 047.nt.l 


12810.0 


10.7 


10.7 


30.0 


30.0 


15.4 


15.4 


16.0 


16.0 


DEX0449 047. nt. 2 


12809. 0 


10.7 


10.7 


30.0 


30.0 


15.4 


15.4 


16.0 


16.0 


DEX0449 047. nt. 2 


12810.0 


10.7 


10.7 


30.0 


30.0 


15.4 


15.4 


16.0 


16.0 


DEX0449 048.nt.l 


28637.0 


71.4 


71.4 


80.0 


80.0 


69.2 


69.2 


76.0 


76.0 i 


DEX0449 048.nt.l 


28638. 0 


64.3 


64.3 


70.0 


70.0 


61.5 


61.5 


68.0 


68.0 


DEX0449 049-nt.l 


10224.0 


3.6 


3.7 


40.0 


44.4 


30.8 


30.8 


4.0 


4.3 


DEX0449 049.nt.l 


10225. 0 


3.6 


3.8 


20.0 


25.0 


23.1 


23 .1 


0.0 


0.0 


DEX0449 OSl.nt.l 


28403 . 0 


3.6 


3.6 


30.0 


30.0 


23 .1 


23 .1 


4.0 


4.0 


DEX0449 052.nt.l 


16385. 0 


14.3 


14.3 


40.0 


40.0 


30.8 


30.8 


16.0 


16.0 


DEX0449 053.nt.l 


17654.0 


0.0 


0.0 


20.0 


20.0 


15.4 


16.7 


0.0 


0.0 


DEX0449 053.nt.l 


17655. 0 


7.1 


7.4 


30.0 


33.3 


38.5 


38.5 


0.0 


0.0 


DEX0449 053. nt. 2 


17654. 0 


0.0 


0.0 


20.0 


20.0 


15 .4 


16.7 


0.0 


0.0 I 


DEX0449 053. nt. 2 


17655.0 


7.1 


7.4 


30.0 


33.3 


38.5 


38.5 


0.0 


0.0 


DEX0449 054.nt.l 


37943.0 


46.4 


46.4 


70.0 


70.0 


76.9 


76.9 


40.0 


40.0 


DEX0449 054.nt.l 


37944. 0 


42.9 


44.4 


70.0 


70.0 


76.9 


76.9 


36.0 


37.5 


DEX0449 054. nt. 2 


37943.0 


46.4 


46.4 


70. 0 


70.0 


76.9 


76.9 


40.0 


40.0 


DEX0449 054. nt. 2 


37944 . 0 


42.9 


44.4 


70.0 


70.0 


76.9 


76.9 


36 .0 


37.5 


DEX0449 055.nt.l 


30951.0 


67.9 


67.9 


60.0 


60.0 


100.0 


100.0 


48.0 


48.0 


DEX0449 OSS.nt.l 


30956. 0 


53.6 


55.6 


60.0 


60.0 


92.3 


92.3 


36.0 


37.5 | 


DEX0449 055.nt.l 


31201.0 


67.9 


67.9 


60.0 


60.0 


100.0 


100.0 


48.0 


48.0 


DEX0449 055. nt. 2 


30951.0 


67.9 


67.9 


60.0 


60.0 


100.0 


100.0 


48.0 


48.0 


DEX0449 055. nt. 2 


30956.0 


53 .6 


55.6 


60.0 


60.0 


92 .3 


92.3 


36.0 


37.5 


DEX0449 055. nt. 2 


31201.0 


67.9 


67.9 


60. 0 


60.0 


100.0 


100.0 


48.0 


48.0 


DEX0449 055. nt. 3 


30951.0 


67.9 


67.9 


60.0 


60.0 


100.0 


100.0 


48 .0 


48.0 


DEX0449 055. nt. 3 


30956.0 


53.6 


55.6 


60.0 


60.0 


92.3 


92.3 


36.0 


37.5 


DEX0449 055. nt. 2 


131201.0 


67.9 


67.9 


60.0 


60.0 


100.0 


100.0 


48.0 


48.0 



Table 3. 



DEX ID 


Oligo 
Name 


Cln 
Multi- 
Cancer 
ALL %up 
n=27 


Cln 

Multi- 
Cancer 
ALL 

% valid 
up n=27 


Cln 

Multi- 
Cancer 
ASC %up 
n=14 


Cln 
Multi- 
Cancer 
ASC 
% valid 
up n=14 


Cln 
Multi- 
Cancer 
RS %up 
n=13 


Cln 
Multi- 
Cancer 
RS 

%valid 
up n=13 


DEX0449 044.nt.l 


78951.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


78952.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


78952.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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DEX0449 044.nt.l. 


92319.1 


D.O 


D.O 


O.O ( 


D.O 


o.o 


o.o 


DEX0449 044.nt.l 


92320.0 


0.0 


0.0 


o.o 


0.0 


o.o 


0.0 


DEX0449 044.nt.l 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78951.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78952.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78952 .1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 I 


DEX0449 044. nt. 2 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 • 


0.0 


DEX0449 044. nt. 3 


78951.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78952.0 


0.0 


0.0 


0.0 


b.o 


0.0 


0.0 


DEX0449 044. nt. 3 


78952.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


o.o 


DEX0449 044. nt. 4 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78951.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78952 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78952.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt . 4 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


92320. 1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 045. nt . 3 


.1045.0 


25.9 


25.9 


14.3 


14 .3 


38.5 


38.5 



BREAST CANCER CHIPS 

For breast cancer two different chip designs were evaluated with overlapping sets 
of a total of 36 samples, comparing title expression patterns of breast cancer derived 

5 polyA+ RNA to polyA+ RNA isolated from a pool of 10 normal breast tissues. For the 
Breast Array Chip, all 36 samples (9 stage I cancers, 23 stage II cancers, 4 stage m 
cancers) were analyzed. These samples also represented 10 Grade 1/2 and 26 Grade 3 
cancers. The histopathologic grades for cancer are classified as follows: GX, cannot be 
assessed; Gl, well differentiated; G2, moderately differentiated; G3, poorly differentiated; 

10 and G4, undifferentiated. AJCC Cancer Staging Handbook, pp. 9, (5th Ed, 1998). 

Samples were further grouped based on the expression patterns of the known breast cancer 
associated genes Her2 and ERa (10 HER2 up, 26 HER2 not up, 20 ER up and 16 ER not 
up) and for the Multi-Cancer Array Chip, a subset of 20 of these samples (9 stage I 
cancers, 8 stage II cancers, 3 stage HI cancers) were assessed. 

15 No results for the statistically significant up-regulated genes on the Breast Array 

Chip are shown. The results for the statistically significant up-regulated genes on the 
Multi-Cancer Array Chip are shown in Table 4. The first two columns of each table 
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contain information about the sequence itself (Seq ID, Oligo Name), the next columns 
show the results obtained for all ("ALL") breast cancer samples, cancers corresponding 
to stagel ("ST1"), stages II and EEL ("ST2,3"), grades 1 and 2 ("GR1,2"), grade 3 
("GR3"), cancers exhibiting up-regulation of Her2 ( i€ HER2up") or ERa ("ERup") or 
5 those not exhibiting up-regulation of Her2 ("NOT HER2up") or ERa ("NOT ERup"). 
'%up' indicates the percentage of all experiments in which up-regulation of at least 2- 
fold was observed (n=36 for Colon Array Chip, n=20 for the Multi-Cancer Array Chip), 
c %valid up' indicates the percentage of experiments with valid expression values in 
which up-regulation of at least 2-fold was observed. 

10 Table 4. 



1 

DEX ID . 

J 


I 
I 

Dligo ( 
*Tams 

i 


j 

4am 

4ulti- 
dancer 
MJj %up 
1=20 i 


4ulti- 
[lancer 

fevalid 
ap n=2 0 


4am 

4ulti- ( 
dancer 

3ti %up ; 

1=9 ; 


4am fl 
4ulti- f 
Hancer ( 
3T1 2 
fevalid • 
jip n=9 i 


4am I 
4ulti- P 
dancer ( 
3T2 , 3 I 
fcup 5 
n=ll i 


4am 

4ulti- 
Zancer 
3T2,3 
fevalid 
up n=ll 


n"EY044Q 044 nh.l 


78951 . 1 


5 . 0 


14.3 


0.0 


0.0 


9.1 


16.7 


■nP , Y044Q 04 4 nfc 1 


78952 . 0 


0 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


n"K , Yfl44Q 044 nt" 1 


78952 . 1 


5 . 0 


14 . 3 


0.0 | 


0.0 


9.1 


16 .7 


nPYOAA Q 044 nt" 1 


92319 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. lit. 1 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78951.1 


5.0 


14.3 


0.0 


0.0 


9.1 


16.7 


DEX0449 044. nt. 2 


78952.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78952.1 


5.0 


14.3 


o.o 1 


0.0 


9.1 


16.7 


DEX0449 044. nt. 2 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78951.1 


5.0 


14.3 


0.0 


0.0 


9.1 


16.7 


DEX0449 044. nt. 3 


78952.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78952.1 


5.0 


14.3 


0.0 


0.0 


9.1 


16.7 


DEX0449 044. nt. 3 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78951.1 


5.0 


14.3 


0.0 


0.0 


9.1 


16.7 


DEX0449 044. nt. 4 


78952.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78952.1 


5.0 


14.3 


0.0 


0.0 


9.1 


16.7 


DEX0449 044. nt. 4 


t 92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt . 4 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


I 92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt . 4 


I 92320. 1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 045. nt . 1 


L1045.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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LUNG CANCER CHIPS 

For lung cancer two different chip designs were evaluated with overlapping sets of 
a total of 29 samples, comparing the expression patterns of lung cancer derived polyA+ 
RNA to polyA+ RNA isolated from a pool of 12 normal lung tissues. For the Lung Array 
5 Chip all 29 samples (15 squamous cell carcinomas and 14 adenocarcinomas including 14 
stage I and 15 stage WILL cancers) were analyzed and for the Multi-Cancer Array Chip a 
subset of 22 of these samples (10 squamous cell carcinomas, 12 adenocarcinomas) were 
assessed. 

The results for the statistically significant up-regulated genes on the Lung Array 
10 Chip are shown in Table 5. The results for the statistically significant up-regulated genes 
on the Multi-Cancer Array Chip are shown in Table 6. The first two columns of each 
table contain information about the sequence itself (DEX ID, Oligo Name), the next 
columns show the results obtained for all ("ALL") lung cancer samples, squamous cell 
carcinomas ("SQ"), adenocarcinomas ("AD"), or cancers corresponding to stage I 
15 ("STl"), or stages II and m ("ST2,3"). '%up' indicates the percentage of all experiments 
in which up-regulation of at least 2-fold was observed (n=29 for Lung Array Chip, n=22 
for Multi-Cancer Array Chip), c %valid up 5 indicates the percentage of experiments with 
valid expression values in which up-regulation of at least 2-fold was observed. 



Table 5. 



DEX ID 


Oligo 
Name 


Lng 
ALL. 
%up 
n=29 


Lng 
ALL 

% valid 
up 
n=2 9 


Lng 
SQ 
%up 
n=15 


Lng SQ 
% valid 
up 
n=15 


Lng 
AD 
%up 
n=14 


Lng AD 
%valid 
up 
n=14 


Lng 
STl 
%up 
n=14 


Lng 
STl 

%valid 

up 

n=14 


Lng 
ST2, 3 
%up 
n=15 


Lng 

ST2,3 

%valid 

up 

n=15 


DEX0449_045 
.nt .1 


1044.0 


13.8 


13.8 


13.3 


13.3 


14.3 


14.3 


14 .3 


14.3 


13.3 


13 .3 


DEX0449_052 
.nt . 1 


5819-0 


6.9 


7.1 


6.7 


6.7 


7.1 


7.7 


7.1 


7.7 


6.7 


6.7 


DEX0449_052 
.nt .1 


5820.0 


6.9 


6.9 


6.7 


6.7 


7.1 


7.1 


7.1 


7.1 


6.7 


6.7 


DEX0449_054 
.nt .1 


6889.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449_054 
.nt .1 


6890.0 


6.9 


7.1 


6.7 


7.1 


7.1 


7.1 


14.3 


15.4 


0.0 


0.0 


DEX0449JD54 
.nt .2 


6889.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449_054 
.nt.2 


6890.0 


6.9 


7.1 


6.7 


7.1 


7.1 


7.1 


14.3 


15.4 


0.0 


0.0 



Table 6. 
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DEX ID J 


] 

I 

Dlxgo ( 

*ame , 
i 

i 


I 

4ultx- ( 
dancer 
MTiTi %up < 


jng 

4ulti- j 

-ancer 

\LL 

kvalid 
ap n=22 


I 

L,ng j 
4ulti- ( 
-ancer 
SQ %up j 


-«ng 

4ulti- j 
dancer ( 
3Q . 
fevalid 
ip n=10 


3 

'lUlCl- j 

dancer 
\D %up 
1=12 


jng 

4ulti- 
lancer j 

fevalid 
up n=12 


DEX0449 044.nt.l 


78951.1 


0.0 


O.O 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


78952.0 


4.5 


33.3 


10.0 


50.0 


0.0 


0.0 


DEX0449 044.nt.l 


78952.1 


0.0 


0.0 i 


0.0 


0.0 I 


0.0 


0.0 


DEX0449 044. lit. 1 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 1 


0.0 


DEX0449 044.nt.l 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.2 


78951.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78952 .0 


4.5 


33 .3 


10.0 


50.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78952.1 


0.0 


0,0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78951.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 ! 


DEX0449 044. nt. 3 


78952.0 


4.5 


33.3 


10.0 


50.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78952.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


DEX0449 044. nt. 4 


78951.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78952.0 


4.5 


33 .3 


10.0 


50.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78952.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt . 4 


92319 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt . 4 


192319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt . 4 


192320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt . 4 


t92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 045. nt. 3 


.1045.0 


4.5 


4.5 


0.0 


0.0 


8.3 


8.3 



OVARIAN CANCER CHIPS 

For ovarian cancer two different chip designs were evaluated with overlapping sets 
of a total of 19 samples, comparing the expression patterns of ovarian cancer derived total 
5 RNA to total RNA isolated from a pool of 9 normal ovarian tissues. For the Multi-Cancer 
Array Chip, all 19 samples (14 invasive carcinomas, 5 low malignant potential samples 
were analyzed and for the Ovarian Array Chip, a subset of 17 of these samples (13 
invasive carcinomas, 4 low malignant potential samples) were assessed. 

No results for the statistically significant up-regulated genes on the Ovarian Array 
10 Chip are shown. The results for the Multi-Cancer Array Chip are shown in Table 7. The 
first two columns of each table contain information about the sequence itself (DEX ID, 



WO 2004/050900 



PCT/US2003/040131 



202 

Oligo Name), the next columns show the results obtained for all ("ALL") ovarian cancer 
samples, invasive carcinomas ("INV") and low malignant potential ("IMP") samples. 
'%up> indicates the percentage of all experiments in which up-regulation of at least 2-fold 
was observed (n=19 for the Multi-Cancer Array Chip, n=17 for the Ovarian Array Chip), 
5 *%valid up 9 indicates the percentage of experiments with valid expression values in which 
up-regulation of at least 2-fold was observed. 



Table 7. 



DEX ID 


< 

Dligo ] 
flame 


1 

Cancer 
MjIi %up 
11=1 y 


Dvr 

4ulti- . 
2ancer 

MiLi 

fcvalid 
up n=19 


Dvr ^ J 

*4ulti- 

Clancer 

INV 'Slip 
n— T A 


JVT 

Hulti- j 

dancer 

INV 

fcvalid 
up n=14 


Dvr | 
Cancer . 

Ltflf "5 Up 


Dvr 

*ulti- 
Cancer 
LMP 

fcvalid 
up n=5 


DEX0449 044.nt.l 


78951.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


78952.0 


5.3 


12.5 


7.1 


16 .7 


0.0 


0.0 


DEX0449 044.nt.l 


78952.1 


5.3 


12.5 


7.1 


14.3 


0.0 


0.0 


DEX0449 044.nt.l 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


92320.0 


0.0 


0.0 


0.0 


p.o 


0.0 


0.0 


DEX0449 044.nt.l 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 ( 


DEX0449 044. nt. 2 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78951.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78952 .0 


5.3 


12.5 


7.1 


16.7 


0.0 


0.0 


DEX0449 044. nt. 2 


78952.1 


5.3 


12 .5 


7.1 


14.3 


0.0 ! 


0.0 


DEX0449 044. nt. 2 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78951.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78952.0 


5.3 


12.5 


7.1 


16.7 


0.0 


0.0 


DEX044 9 044. nt. 3 


78952.1 


5.3 


12.5 


7.1 


14.3 


0.0 


0.0 


DEX0449 044. nt. 3 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78951.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78951.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78952.0 


5.3 


12.5 


7.1 


16.7 


0.0 


0.0 


DEX0449 044. nt. 4 


78952.1 


5.3 


12.5 


7.1 


14.3 


0.0 


0.0 


DEX0449 044. nt. 4 


92319.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


92319.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


92320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


92320.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 045. nt. 3 


.1045.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 



PROSTATE CANCER 
10 For prostate cancer three different chip designs were evaluated with overlapping 

sets of a total of 29 samples, comparing the expression patterns of prostate cancer or 
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benign disease derived total RNA to total RNA isolated from a pool of 35 normal prostate 
tissues. For the Prostate 1 Array and Prostate 2 Array Chips all 29 samples (17 prostate 
cancer samples, 12 non-malignant disease samples) were analyzed. For the Multi-Cancer 
Array Chip a subset of 28 of these samples (16 prostate cancer samples, 12 non-malignant 
5 disease samples) were analyzed. 

The results for the statistically significant up-regulated genes on the Prostate 1 
Array Chip and the Prostate2 Array Chip are shown in Table 8. The results for the 
statistically significant up-regulated genes on the Multi-Cancer Array Chip are shown in 
Table 9. The first two columns of each table contain information about the sequence 

10 itself (DEX ID, Oligo Name), the next columns show the results obtained for prostate 
cancer samples ("CAN") or non-malignant disease samples ("DIS"). '%up* indicates the 
percentage of all experiments in which up-regulation of at least 2-fold was observed (n=29 
for the Prostate 1 Array Chip and the Multi-Cancer Array Chip), '%valid up' indicates the 
percentage of experiments with valid expression values in which up-regulation of at least 

15 2-fold was observed. 



Table 8. 



DEX ID 


Oligo 
Name 


Pro CAN 
%up n=17 


Pro CAN 
%valid up 
n=17 


Pro DIS 
%up n=12 


Pro DIS 
% valid up 
n=12 


DEX0449 045.nt.l 


27017.01 


0.0 


0.0 


0.0 


0.0 


DEX0449 045.nt.l 


27017. 02 


0.0 


0.0 


0.0 


0.0 


DEX0449 045.nt.l 


28249.01 


0.0 


0.0 


0.0 


0.0 


DEX0449 045.nt.X 


28249.02 


0.0 


0.0 


0.0 


0.0 


DEX0449 045. nt. 3 


27017.01 


0.0 


0.0 


0.0 


0.0 


DEX0449 045. nt. 3 


27017.02 


0.0 


0.0 


0.0 


0.0 


DEX0449 047.nt.l 


31094.01 


0.0 


0.0 


0.0 


0.0 


DEX0449 047.nt.l 


31094.02 


0.0 


0.0 


0.0 


0.0 


DEX0449 047.nt.l 


31094 .03 


0.0 


0.0 


0.0 


0.0 


DEX0449 047.nt.l 


33588.01 


5.9 


8.3 


0.0 


0.0 


DEX0449 047.nt.l 


33588.02 


0.0 


0.0 


0.0 


0.0 


DEX0449 047. nt. 2 


31094.01 


0.0 


0.0 


0.0 


0.0 


DEX0449 047. nt. 2 


31094.02 


0.0 


0.0 


0.0 


0.0 


DEX0449 047. nt. 2 


31094 .03 


0.0 


0.0 


0.0 


0.0 


DEX0449 047. nt. 2 


33588.01 


5.9 


8.3 


0.0 


0.0 


DEX0449 047. nt. 2 


33588.02 


0.0 


0.0 


0.0 


0.0 


DEX0449 OSl.nt.l 


35746.01 


0.0 


0.0 


0.0 


0.0 


DEX0449 OSl.nt.l 


35746.02 


0.0 


0.0 


0.0 


0.0 


DEX0449 OSl.nt.l 


35746.03 


0.0 


0.0 


0.0 


0.0 


DEX0449 OSl.nt.l 


35788.01 


0.0 


0.0 


0.0 


0.0 


DEX0449 OSl.nt.l 


35788.02 


0.0 


0.0 


0.0 


0.0 


DEX0449 OSl.nt.l 


35788.03 


0.0 


0.0 


0.0 


0.0 



Table 9. 
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DBX ID J 


tfame 


?ro Multi- | 
Zancer CAN , 
tup n=ib 


Pro Multi- j 
dancer CAN j 
valid up j 
1=16 


a^ij-iL Mill f 4 _ ^ 

?ro Muici- j 


Pro Multi - 
:ancer DIS 
fcvalid up 
1=12 


DEX0449 044.nt.l* 


78951.1 


0.0 


0.0 


0.0 


3.0 1 


DEX0449 044.nt.l 


78952.0 


5.9 


7.7 


o.o 


O.O 


DEX0449 044.nt.l 


78952.1 


5.9 


7.1 


0.0 


0.0 


DEX0449 044.nt.l 


92319.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


92319.1 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


92320.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044.nt.l 


92320.1 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. lit. 2 


78951.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78951.1 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


78952.0 


5.9 


7.7 


0.0 


0.0 


DEX0449 044. nt. 2 


78952 .1 


5.9 


7.1 


0.0 


0.0 


DEX0449 044. nt. 2 


92319.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92319.1 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92320.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 2 


92320.1 


0.0 


0 . 0 


0.0 


0.0 


DEX0449 044. nt. 3 


78951.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78951.1 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


78952 .0 


5.9 


7.7 


0.0 


0.0 


DEX0449 044. nt. 3 


78952.1 


5.9 


7.1 


0.0 


0.0 


DEX0449 044. nt. 3 


92319.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92319.1 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92320.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 3 


92320.1 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78951.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78951.1 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


78952.0 


5.9 


7.7 


0.0 


0.0 


DEX0449 044. nt. 4 


78952.1 


5.9 


7.1 


0.0 


0.0 


DEX0449 044. nt. 4 


92319.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


92319.1 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt. 4 


k 92320.0 


0.0 


0.0 


0.0 


0.0 


DEX0449 044. nt . 4 


I 92320.1 


0.0 


0.0 


0.0 


0.0 


DEX0449 045. nt. 3 


.1045.0 


0.0 


0.0 


0.0 


0.0 



SEQ ID NO: 1-1 12 was up-regulated on various tissue microarrays. Accordingly, 
nucleotide SEQ ID NO: 1-1 12 or the encoded protein SEQ ID NO: 1 13-259 may be used 
as a cancer therapeutic and/or diagnostic target for the tissues in which expression is 
5 shown. 

The following table lists the location (Oligo Location) where the microarray oligos 
(Oligo ID) map on the transcripts (DEX ID) of the present invention. Each Oligo ID may 
have been printed multiple times on a single chip as replicates. The Oligo Name is an 
10 exemplary replicate (e.g. 1000.01) for the Oligo ID (e.g. 1000), and data from other 

replicates (e.g. 1000.02, 1000.03) may be reported. Additionally, the Array (Chip Name) 
that each oligo and oligo replicates were printed on is included. 
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DEX NT ID 


Oligo ID 


Oligo Name 


Chip Name 


Oligo 
Location 

XJ]W\<r(iX %*w JL wn 


DEX0449 OOl.nt.l 


10296 


10296 . 0 


2olon arrav 

wx vii ux x* t* y 


1034-1093 


DEX0449 OOl.nt.l 


10297 


10297 . 0 


2olon arrav 

WVXV14 X- ^ w& Y 


287-346 


DEX0449 044.nt.l 


78952 


78952 . 0 


tfulti - Cancer array 


428-487 


DEX0449 044.nt.l 


33264 


33264 . 0 


Colon array 


405-464 


DEX0449 044.nt.l 


92319 


92319 .0 


Villi ti -Cancer array 


546-605 


DEX0449 044.nt.l 


78951 


78951 . 0 


Vlulti -Cancer array 


425-484 


DEX0449 044.nt.l 


33265 


33265 . 0 


Colon array 


356-415 


DEX0449 044.nt.l 


92320 


92320 . 0 


Mul ti -Cancer array 


526-585 


DEX0449 044. nt. 2 


92319 


92319 .0 


Mul ti -Cancer array 


592-651 


DEX0449 044. nt. 2 


92320 


92320 .0 


Mul t i - Cancer arrav 

* A *X «^ \m VU*lVvX V* X* X- ^4 jr 


572-631 


DEX0449 044. nt. 2 


33264 


33264 . 0 


Colon array 


451-510 


DEX0449 044. nt. 2 


78952 


78952 . 0 


Mill t~ i -Panrpr a rrav 


474-533 


DEX0449 044. nt. 2 


33265 


33265 . 0 


Colon arrav 


402-461 


DEX0449 044. nt. 2 


78951 


78951 . 0 


Mill t~ "i -Panppr a rrav 


471-530 


DEX0449 044. nt. 3 


92320 


92320 . 0 


Mul ti -Cancer arrav 


359-418 


DEX0449 044. nt. 3 


33265 


33265 . 0 


Colon arrav 


189-248 


DEX0449 044. nt. 3 


78952 


78952 . 0 


Mul t - i — Can far arrav 


261-320 


DEX0449 044. nt. 3 


33264 


33264 . 0 


Colon arrav 


238-297 


DEX0449 044. nt. 3 


78951 


78951 . 0 


Mill t~ "i -fanrPT a rra v 


258-317 


DEX0449 044. nt. 3 


92319 


92319 . 0 


Mul ti -Cancer arrav 

1 1UJL LX ^»Q11V< Cl CL JL X ay 


379-438 


DEX0449 044. nt. 4 


92320 


92320 0 


Mill hi — fa n f £»r arrav 

riUXLl V>C111L-C L CLJL.JL.Cljr 


J J O O -7 *± 


DEX0449 044. nt 4 


78952 


78952 0 


Mill hi — f a TI r"" 1 <^r a rra v 

nux ux VvCUiwci a jl x, ay 


^ j / j \D 


DEX0449 044. nt. 4 


33265 


33265 0 


fnl on arrav 

UUXUll Ct JL X CL jr 


X O _J ^ ^ *± 


DEX044 9 044 nt 4 


92319 


92319 . 0 


nux lx i_axi\_ ox ctx x ay 


j D 3 ill 


DEX0449 044. nt. 4 


33264 


33264 . 0 


f^ol on arrav 

CLX. J. CL y 


214-273 


DEX0449 044. nt. 4 


78951 


78951 . 0 


Mill t~ "i -Panppr arrav 

I'JUX L X LuULCl axXQV 


234-293 


DEX0449 045.nt.l 


1044 


1044 . 0 


Tiiinci arrav 
xj uiiy axx ay 


X / Zj J *± 


DEX0449 045.nt.l 


27017 


27017 . 01 


Prostahel arrav 


1043-1102 


DEX0449 045.nt.l 


1045 


1045 . 0 


Mill t~ *i -Panrpr arrav 

I'lUX k>X \#Ullk#WX CL X- X. CL Jy 


165-224 


DEX0449 045.nt.l 


28249 


28249 02 


PrnRt*at"p1 arT*av 


323-3 82 

«? A O O O A 


DEX0449 045. nt. 2 


1044 


1044 . 0 


T til no a rra v 


175-234 


DEX0449 045. nt. 2 


1045 


1045 . 0 


Mul t i — Cancer arrav 

i iixx u>x vmiucx axxay 


165-224 

X O J mil *& *± 


DEX0449 045. nt. 3 


1044 


1044 . 0 


Liincr arrav 


160-219 

X D V ^< X _3 


DEX0449 045. nt. 3 


27017 


27017 01 


Prostatel arrav 

CXVSUQLCX CLXXCLjy 


1613 - 1672 


DEX0449 045. nt. 3 


1045 


1045 . 0 


Mill hi -Tanppr arrav 
riuxLX ^oiiv»cx ctJLJLcLjr 


1 5fi-2nQ 
19 W ^ U J7 


DEX0449 046.nt.l 


33720 


33720 . 0 


Pol f\ri arva 

UUXUll CLJL X Cl Jf 


Iuj /— 1U7Q 


DEX0449 047 . nt . 1 


12809 


12809 0 


Col on a r* ra v 

WUXUil CLX x cty 


901 1 .7D7n 
oOll ZtJ / U 


DEX0449 047.nt.l 


33588 


33588 01 


pT*ont" a h arrav 
rxuDuaucx axxay 


77Q7-7Pi;fi 


DEX0449 047.nt.l 


12810 


12810 0 

JL. <fc* W JL, w • \J 


CoT on a rrav 
uuxuii ax x ay 


777'! .9000 
^ / / J ^OJZ 


DEX0449 047.nt.l 


31094 


31094 02 

*J JL. W ^ nX • w m 


Prnnhahp? arrav 
rxuB uauco axx ay 


^ J17 Z J /O 


DEX0449 047.nt.2 


12810 


12810 0 

-J- W JL. • W 


Col on =a rra v 

V^WJ-V^/XX CLX. X. Cl jr 


577Q .7010 


DEX0449 047 nt 2 


335 88 


J * \S X 


rxUouaLcl ax I ay 


Z /i? / — «oDO 


DEX0449 047. nt. 2 


31094 


31094 02 


Prnahahp5 a rrav 
rivoLaLC£ axx ay 




DEX0449 047. nt. 2 


12809 


12809 0 


Col on arrav 
LjUxuii axx ay 




DEX0449 04 8 nt 1 


28637 


^ O O -J / • V 


uuiuii ax x ay 


J 3 / "SID 


DEX0449 048.nt.l 


28638 


28638.0 


Colon array 


216-275 


DEX0449 049.nt.l 


10224 


10224.0 


Colon array 


360-419 


DEX0449 04 9.nt.l 


10225 


10225.0 


Colon array 


242-301 


DEX0449 051.nt.l 


35788 


35788.03 


Prostate2 array 


1107-1166 


DEX0449 OSl.nt.l 


28403 


28403 .0 


Colon array 


1107-1166 


DEX0449 OSl.nt.l 


35746 


35746.02 


Prostate2 array 


893-952 


DEX0449 052.nt.l 


5820 


5820.0 


Lung array 


1714-1773 


DEX0449 052.nt.l 


16385 


16385.0 


Colon array- 


2216-2275 


DEX0449 052.nt.l 


5819 


5819.0 


Lung array 


1865-1924 


DEX0449 053.nt.l 


17655 


17655.0 


Colon array 


1536-1595 
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DEX044 9 053 nfc 1 


17654 


17654 . 0 


Colon airxay 


1600-1659 


DEX0449 053 nt 2 


17655 


17655 . 0 


Col on a. jrray 


1055-1114 


DEX0449 053 nt 2 


17654 


17654 . 0 


Colon arrav 


1119-1178 


DEX0449 054 nt . 1 


37944 


37944 . 0 


Colon array 


1296-1355 


DRX0449 054 nt 1 


5889 


6889 . 0 


Luna arrav 


3086-3145 


DEX0449 054. nt 1 


6890 


6890 . 0 


Xiiing array 


2956-3015 


DEX0449 054 nt 1 


37943 


37943 . 0 


Colon array 


1416-1475 


DEX0449 054. nt. 2 


6889 


6889.0 


Lung array 


2461-2520 


DEX0449 054. nt. 2 


37944 


37944.0 


Colon array 


671-730 


DEX0449 054. nt. 2 


6890 


6890.0 


Lung array 


2331-2390 


DEX0449 055.nt.l 


30956 


30956.0 


Colon array 


4398-4457 


DEX0449 055.nt.l 


31201 


31201.0 


Colon array 


4333-4392 


DEX0449 055.nt.l 


30951 


30951.0 


Colon array 


4334-4393 


DEX0449 055. nt. 2 


30956 


30956.0 


Colon array 


1809-1868 


DEX0449 055. nt. 2 


31201 


31201.0 


Colon array 


1744-1803 


DEX0449 055. nt. 3 


30951 


30951.0 


Colon array 


3899-3958 


DEX0449 055. nt. 3 


30956 


30956.0 


Colon array 


3963-4022 



Example 2b: Relative Quantitation of Gene Expression 

Real-Time quantitative PCR with fluorescent Taqman® probes is a quantitation 
detection system utilizing the 5'- 3 9 nuclease activity of Taq DNA polymerase. The 

5 method uses an internal fluorescent oligonucleotide probe (Taqman 8 *) labeled with a 5 9 
reporter dye and a downstream, 3' quencher dye. During PCR, the 5'-3' nuclease activity 
of Taq DNA polymerase releases the reporter, whose fluorescence can then be detected by 
the laser detector of the Model 7700 Sequence Detection System (PE Applied Biosystems, 
Foster City, CA, USA). Amplification of an endogenous control is used to standardize the 

10 amount of sample RNA added to the reaction and normalize for Reverse Transcriptase 
(RT) efficiency. Either cyclophilin, glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH), ATPase, or 18S ribosomal RNA (rRNA) is used as this endogenous control. 
To calculate relative quantitation between all the samples studied, the target RNA levels 
for one sample were used as the basis for comparative results (calibrator). Quantitation 

15 relative to the "calibrator" can be obtained using the comparative method (User Bulletin 
#2: ABI PRISM 7700 Sequence Detection System). 

The tissue distribution and the level of the target gene are evaluated for every 
sample in normal and cancer tissues. Total RNA is extracted from normal tissues, cancer 
tissues, and from cancers and the corresponding matched adjacent tissues. Subsequently, 

20 first strand cDNA is prepared with reverse transcriptase and the polymerase chain reaction 
is done using primers and Taqman® probes specific to each target gene. The results are 
analyzed using the ABI PRISM 7700 Sequence Detector. The absolute numbers are 
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relative levels of expression of the target gene in a particular tissue compared to the 
calibrator tissue. 

One of ordinary skill can design appropriate primers. The relative levels of 
expression of the CSNA versus normal tissues and other cancer tissues can then be 
5 determined. All the values are compared to the calibrator. Normal RNA samples are 
commercially available pools, originated by pooling samples of a particular tissue from 
different individuals. 

The relative levels of expression of the CSNA in pairs of matched samples may 
also be determined. A matched pair is formed by mRNA from the cancer sample for a 
10 particular tissue and mRNA from the normal adjacent sample for that same tissue from the 
same individual. All the values are compared to the calibrator. 

In the analysis of matching samples, the CSNAs show a high degree of tissue 
specificity for the tissue of interest. These results confirm the tissue specificity results 
obtained with normal pooled samples. Further, the level of mRNA expression in cancer 
15 samples and the isogenic normal adjacent tissue from the same individual are compared. 
This comparison provides an indication of specificity for the cancer state (e.g. higher 
levels of mRNA expression in the cancer sample compared to the normal adjacent). 

Informaton on the samples tested in the QPCR experiments include the Sample ID 
(Smpl ID), Organ, Tissue Type (Tiss Type), Diagnosis (DIAG), Disease Detail, and Stage 
20 or Grade (STG or GRD). 

Altogether, the high level of tissue specificity, plus the mRNA overexpression in 
matched samples tested are indicative of SEQ ID NO: 1-1 12 being a diagnostic marker 
and/or a therapeutic target for cancer. 

Example 3: Protein Expression 

25 The CSNA is amplified by polymerase chain reaction (PCR) and the amplified 

DNA fragment encoding the CSNA is subcloned in pET-21d for expression in E. coli. In 
addition to the CSNA coding sequence, codons for two amino acids, Met-Ala, flanking the 
NH 2 -terminus of the coding sequence of CSNA, and six histidines, flanking the 
COOH-terminus of the coding sequence of CSNA, are incorporated to serve as initiating 

30 Met/restriction site and purification tag, respectively. 
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An over-expressed protein band of the appropriate molecular weight may be 
observed on a Coomassie blue stained polyacrylamide gel. This protein band is confirmed 
by Western blot analysis using monoclonal antibody against 6X Histidine tag. 

Large-scale purification of CSP is achieved using cell paste generated from 6-liter 
5 bacterial cultures, and purified using immobilized metal affinity chromatography (IMAC). 
Soluble fractions that are separated from total cell lysate were incubated with a nickel 
chelating resin. The column is packed and washed with five column volumes of wash 
buffer. CSP is eluted stepwise with various concentration imidazole buffers. 

Example 4: Fusion Proteins 

10 The human Fc portion of the IgG molecule can be PCR amplified, using primers 

that span the 5'and 3' ends of the sequence described below. These primers also should 
have convenient restriction enzyme sites that will facilitate cloning into an expression 
vector, preferably a mammalian expression vector. For example, if pC4 (Accession No. 
209646) is used, the human Fc portion can be ligated into the BamHI cloning site. Note 

15 that the 3' BamHI site should be destroyed. Next, the vector containing the human Fc 
portion is re-restricted with BamHI, linearizing the vector, and a polynucleotide of the 
present invention, isolated by the PCR protocol described in Example 2, is ligated into this 
BamHI site. Note that the polynucleotide is cloned without a stop codon, otherwise a 
fusion protein will not be produced. If the naturally occurring signal sequence is used to 

20 produce the secreted protein, pC4 does not need a second signal peptide. Alternatively, if 
the naturally occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. See, e.g. 9 WO 96/34891. 

Example 5: Production of an Antibody from a Polypeptide 

In general, such procedures involve immunizing an animal (preferably a mouse) 
25 with polypeptide or, more preferably, with a secreted polypeptide-expressing cell. Such 
cells may be cultured in any suitable tissue culture medium; however, it is preferable to 
culture ceils in Earle's modified Eagle's medium supplemented with 10% fetal bovine 
serum (inactivated at about 56°C), and supplemented with about 10 g/1 of nonessential 
amino acids, about 1,000 U/ml of penicillin, and about 100, pg/ml of streptomycin. The 
30 splenocytes of such mice are extracted and fused with a suitable myeloma cell line. Any 
suitable myeloma cell line may be employed in accordance with the present invention; 
however, it is preferable to employ the parent myeloma cell line (SP20), available from 
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the ATCC. After fusion, the resulting hybridoma cells are selectively maintained in HAT 
medium, and then cloned by limiting dilution as described by Wands et al, 
Gastroenterology 80: 225-232 (1981). 

The hybridoma cells obtained through such a selection are then assayed to identify 

5 clones which secrete antibodies capable of binding the polypeptide. Alternatively, 

additional antibodies capable of binding to the polypeptide can be produced in a two-step 
procedure using anti-idiotypic antibodies. Such a method makes use of the fact that 
antibodies are themselves antigens, and therefore, it is possible to obtain an antibody 
which binds to a second antibody. In accordance with this method, protein specific 

10 antibodies are used to immunize an animal, preferably a mouse. The splenocytes of such 
an animal are then used to produce hybridoma cells, and the hybridoma cells are screened 
to identify clones which produce an antibody whose ability to bind to the protein-specific 
antibody can be blocked by the polypeptide. Such antibodies comprise anti-idiotypic 
antibodies to the protein specific antibody and can be used to immunize an animal to 

1 5 induce formation of further protein-specific antibodies. 

Example 6: Method of Determining Alterations in a Gene Corresponding to a 
Polynucleotide 

RNA is isolated from individual patients or from a family of individuals that have 
a phenotype of interest cDNA is then generated from these RNA samples using protocols 

20 known in the art. See, Sambrook (2001), supra. The cDNA is then used as a template for 
PCR, employing primers surrounding regions of interest in SEQ ID NO: 1-112. 
Suggested PCR conditions consist of 35 cycles at 95°C for 30 seconds; 60-120 seconds at 
52-58°C; and 60-120 seconds at 70°C, using buffer solutions described in Sidransky et al, 
Science 252(5006): 706-9 (1991). See also Sidransky et al 9 Science 278(5340): 1054-9 

25 (1997). 

PCR products are then sequenced using primers labeled at their 5' end with T4 
polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). The 
intron-exon borders of selected exons are also determined and genomic PCR products 
analyzed to confirm the results. PCR products harboring suspected mutations are then 
30 cloned and sequenced to validate the results of the direct sequencing. PCR products is 
cloned into T-tailed vectors as described in Holton et al, Nucleic Acids Res., 19: 1156 
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(1991) and sequenced with T7 polymerase (United States Biochemical). Affected 
individuals are identified by mutations not present in unaffected individuals. 

Genomic rearrangements may also be determined. Genomic clones are 
nick-translated with digoxigenin deoxyuridine 5' triphosphate (Boehringer Manheim), and 
5 FISH is performed as described in Johnson et al, Methods Cell Biol 35: 73-99 (1991). 
Hybridization with the labeled probe is carried out using a vast excess of human cot-1 
DNA for specific hybridization to the corresponding genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenylidole and propidium 
iodide, producing a combination of C-and R-bands. Aligned images for precise mapping 
10 are obtained using a triple-band filter set (Chroma Technology, Brattleboro, VT) in 

combination with a cooled charge-coupled device camera (Photometries, Tucson, AZ) and 
variable excitation wavelength filters. Johnson (1991). Image collection, analysis and 
chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC.) Chromosome alterations of the 
15 genomic region hybridized by the probe are identified as insertions, deletions, and 

translocations. These alterations are used as a diagnostic marker for an associated disease. 

Example 7: Method of Detecting Abnormal Levels of a Polypeptide in a Biological 
Sample 

Antibody-sandwich ELISAs are used to detect polypeptides in a sample, preferably 
20 a biological sample. Wells of a microliter plate are coated with specific antibodies, at a 
final concentration of 0.2 to 10 ug/ml. The antibodies are either monoclonal or polyclonal 
and are produced by the method described above. The wells are blocked so that 
non-specific binding of the polypeptide to the well is reduced. The coated wells are then 
incubated for > 2 hours at RT with a sample containing the polypeptide. Preferably, serial 
25 dilutions of the sample should be used to validate results. The plates are then washed 

three times with deionized or distilled water to remove unbound polypeptide. Next, 50 Ml 
of specific antibody-alkaline phosphatase conjugate, at a concentration of 25-400 ng, is 
added and incubated for 2 hours at room temperature. The plates are again washed three 
times with deionized or distilled water to remove unbound conjugate. 75 pi of 
30 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl phosphate (NPP) substrate 
solution are added to each well and incubated 1 hour at room temperature. 
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The reaction is measured by a microtiter plate reader. A standard curve is 
prepared, using serial dilutions of a control sample, and polypeptide concentrations are 
plotted on the X-axis (log scale) and fluorescence or absorbance on the Y-axis (linear 
scale). The concentration of the polypeptide in the sample is calculated using the standard 
5 curve. 

Example 8: Formulating a Polypeptide 

The secreted polypeptide composition will be formulated and dosed in a fashion 
consistent with good medical practice, taking into account the clinical condition of the 
individual patient (especially the side effects of treatment with the secreted polypeptide 

10 alone), the site of delivery, the method of administration, the scheduling of administration, 
and other factors known to practitioners. The "effective amount" for purposes herein is 
thus determined by such considerations. 

As a general proposition, the total pharmaceutically effective amount of secreted 
polypeptide administered parenterally per dose will be in the range of about 1, jog/kg/day 

15 to 10 mg/kg/day of patient body weight, although, as noted above, this will be subject to 
therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and most 
preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If given 
continuously, the secreted polypeptide is typically administered at a dose rate of about 1 
jig/kg/hour to about 50 mg/kg/hour, either by 1-4 injections per day or by continuous 

20 subcutaneous infusions, for example, using a mini-pump. An intravenous bag solution 
may also be employed. The length of treatment needed to observe changes and the 
interval following treatment for responses to occur appears to vary depending on the 
desired effect 

Pharmaceutical compositions containing the secreted protein of the invention are 
25 administered orally, rectally, parenterally, intracistemally, intravaginally, intraperitoneally, 
topically (as by powders, ointments, gels, drops or transdermal patch), bucally, or as an 
oral or nasal spray. "Pharmaceutically acceptable carrier" refers to a non-toxic solid, 
semisolid or liquid filler, diluent, encapsulating material or formulation auxiliary of any 
type. The term "parenteral" as used herein refers to modes of administration which 
30 include intravenous, intramuscular, intraperitoneal, intrasternal, subcutaneous and 
intraarticular injection and infusion. 
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The secreted polypeptide is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semipermeable 
polymer matrices in the form of shaped articles, e.g., films, or microcapsules. Sustained- 
release matrices include polylactides (U. S. Pat No.3,773,919, EP 58,481, the contents of 
5 which are hereby incorporated by reference herein in their entirety), copolymers of L- 
glutamic acid and gamma-ethyl-L-glutamate (Sidman, U. et al., Biopolymers 22: 547-556 
(1983)), poly (2-hydroxyethyl methacrylate) (R. Langer et al., J. Biomed. Mater. Res. 15: 
167-277 (1981), and R. Langer, Chem. Tech. 12: 98-105 (1982)), ethylene vinyl acetate 
(R. Langer et al.) or poly-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained-release 
10 compositions also include liposomally entrapped polypeptides. Liposomes containing the 
secreted polypeptide are prepared by methods known per se: DE Epstein et al., Proc. Natl. 
Acad. Sci. USA 82: 3688-3692 (1985); Hwang et al., Proc. Natl. Acad. Sci. USA 77: 
4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese 
Pat Appl. 83-118008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324, the 
15 contents of which are hereby incorporated by reference herein in their entirety. Ordinarily, 
the liposomes are of the small (about 200-800 Angstroms) unilamellar type in which the 
lipid content is greater than about 30 mol. percent cholesterol, the selected proportion 
being adjusted for the optimal secreted polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 
20 formulated generally by mixing it at the desired degree of purity, in a unit dosage 

injectable form (solution, suspension, or emulsion), with a pharmaceutical^ acceptable 
carrier, i.e., one that is non-toxic to recipients at the dosages and concentrations employed 
and is compatible with other ingredients of the formulation. 

For example, the formulation preferably does not include oxidizing agents and 
25 other compounds that are known to be deleterious to polypeptides. Generally, the 

formulations are prepared by contacting the polypeptide uniformly and intimately with 
liquid carriers or finely divided solid carriers or both. Then, if necessary, the product is 
shaped into the desired formulation. Preferably, the carrier is a parenteral carrier, more 
preferably, a solution that is isotonic with the blood of the recipient. Examples of such 
30 carrier vehicles include water, saline, Ringer's solution, and dextrose solution. Non- 
aqueous vehicles such as fixed oils and ethyl oleate are also useful herein, as well as 
liposomes. 
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The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at the 
dosages and concentrations employed, and include buffers such as phosphate, citrate, 
succinate, acetic acid, and other organic acids or their salts; antioxidants such as ascorbic 
5 acid; low molecular weight (less than about ten residues) polypeptides, e. g., polyarginine 
or tripeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic 
polymers such as polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, 
aspartic acid, or arginine; monosaccharides, disaccharides, and other carbohydrates 
including cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such 
10 as EDTA; sugar alcohols such as mannitol or sorbitol; counterions such as sodium; and/or 
nonionic surfactants such as polysorbates, poloxamers, or PEG. 

The secreted polypeptide is typically formulated in such vehicles at a concentration 
of about 0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of about 3 to 8. It will 
be understood that the use of certain of the foregoing excipients, carriers, or stabilizers 
1 5 will result in the formation of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterile. Sterility 
is readily accomplished by filtration through sterile filtration membranes (e.g., 0.2 micron 
membranes). Therapeutic polypeptide compositions generally are placed into a container 
having a sterile access port, for example, an intravenous solution bag or vial having a 
20 stopper pierceable by a hypodermic injection needle. 

Polypeptides ordinarily will be stored in unit or multi-dose containers, for 
example, sealed ampules or vials, as an aqueous solution or as a lyophilized formulation 
for reconstitution. As an example of a lyophilized formulation, 10-ml vials are filled with 
5 ml of sterile-filtered 1 % (w/v) aqueous polypeptide solution, and the resulting mixture 
25 is lyophilized. The infusion solution is prepared by reconstituting the lyophilized 
polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more of the ingredients of the pharmaceutical compositions 
of the invention. Associated with such container (s) can be a notice in the form prescribed 
30 by a governmental agency regulating the manufacture, use or sale of pharmaceuticals or 
biological products, which notice reflects approval by the agency of manufacture, use or 
sale for human administration. In addition, the polypeptides of the present invention may 
be employed in conjunction with other therapeutic compounds. 
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Example 9: Method of Treating Decreased Levels of the Polypeptide 

It will be appreciated that conditions caused by a decrease in the standard or 
normal expression level of a secreted protein in an individual can be treated by 
administering the polypeptide of the present invention, preferably in the secreted form. 

5 Thus, the invention also provides a method of treatment of an individual in need of an 
increased level of the polypeptide comprising administering to such an individual a 
pharmaceutical composition comprising an amount of the polypeptide to increase the 
activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily dose 

10 0. 1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the polypeptide is 
in the secreted form. The exact details of the dosing scheme, based on administration and 
formulation, are provided above. 

Example 10: Method of Treating Increased Levels of the Polypeptide 

Antisense or RNAi technology are used to inhibit production of a polypeptide of 
15 the present invention. This technology is one example of a method of decreasing levels of 
a polypeptide, preferably a secreted form, due to a variety of etiologies, such as cancer. 

For example, a patient diagnosed with abnormally increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 2.0 
and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest period if the 
20 treatment was well tolerated. The formulation of the antisense polynucleotide is provided 
above. 

Example 11: Method of Treatment Using Gene Therapy 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient Generally, fibroblasts are obtained from a 

25 subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 

separated into small pieces. Small chunks of the tissue are placed on a wet surface of a 
tissue culture flask, approximately ten pieces are placed in each flask. The flask is turned 
upside down, closed tight and left at room temperature over night. After 24 hours at room 
temperature, the flask is inverted and the chunks of tissue remain fixed to the bottom of 

30 the flask and fresh media (e. g., Ham's F12 media, with 10% FBS, penicillin and 

streptomycin) is added. The flasks are then incubated at 37°C for approximately one week. 
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At this time, fresh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
monolayer is trypsinized and seeded into larger flasks. pMV-7 (Kirschmeier, P. T. et al., 
DNA, 7: 219-25 (1988)), flanked by the long terminal repeats of the Moloney murine 
5 sarcoma virus, is digested with EcoRI and HindTTT and subsequently treated with calf 

intestinal phosphatase. The linear vector is fractionated on agarose gel and purified, using 
glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified using 
PCR primers which correspond to the 5'and 3 f end sequences respectively as set forth in 

10 Example 3. Preferably, the S'primer contains an EcoRI site and the 3 f primer includes a 
HindTTT site. Equal quantities of the Moloney murine sarcoma virus linear backbone and 
the amplified EcoRI and Hindm fragment are added together, in the presence of T4 DNA 
ligase. The resulting mixture is maintained under conditions appropriate for ligation of the 
two fragments. The ligation mixture is then used to transform bacteria HB 101, which are 

1 5 then plated onto agar containing kanamycin for the purpose of confirming that the vector 
has the gene of interest properly inserted. 

The amphotropic pA3 17 or GP+aml2 packaging cells are grown in tissue culture to 
confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 10% calf serum 
(CS), penicillin and streptomycin. The MSV vector containing the gene is then added to 

20 the media and the packaging cells transduced with the vector. The packaging cells now 
produce infectious viral particles containing the gene (the packaging cells are now referred 
to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the media 
is harvested from a 10 cm plate of confluent producer cells. The spent media, containing 

25 the infectious viral particles, is filtered through a millipore filter to remove detached 
producer cells and this media is then used to infect fibroblast cells. Media is removed 
from a sub-confluent plate of fibroblasts and quickly replaced with the media from the 
producer cells. This media is removed and replaced with fresh media. 

If the titer of virus is high, then virtually all fibroblasts will be infected and no 

30 selection is required. If the titer is very low, then it is necessary to use a retroviral vector 
that has a selectable marker, such as neo or his. Once the fibroblasts have been efficiently 
infected, the fibroblasts are analyzed to determine whether protein is produced. 
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The engineered fibroblasts are then transplanted onto the host, either alone or after 
having been grown to confluence on cytodex 3 microcarrier beads. 

Example 12: Method of Treatment Using Gene Therapy-In Vivo 

Another aspect of the present invention is using in vivo gene therapy methods to 
5 treat disorders, diseases and conditions. The gene therapy method relates to the 

introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) sequences 
into an animal to increase or decrease the expression of the polypeptide. 

The polynucleotide of the present invention may be operatively linked to a 
promoter or any other genetic elements necessary for the expression of the polypeptide by 
10 the target tissue. Such gene therapy and delivery techniques and methods are known in 
the art, see, for example, Tabata H. et al Cardiovasc. Res. 35 (3): 470-479 (1997); Chao J 
etal Pharmacol. Res. 35 (6): 517-522 (1997); Wolff J. A. Neuromuscul Disord. 1 (5): 
314-318 (1997), Schwartz B. et al Gene Titer. 3 (5): 405-41 1 (1996); and Tsurumi Y. et 
al Circulation 94 (12): 3281-3290 (1996); W0 90/11092, W0 98/11779; U. S. Patent No. 
15 5,693,622; 5,705,151; 5,580,859, the contents of which are hereby incorporated by 
reference herein in their entirety. 

The polynucleotide constructs may be delivered by any method that delivers 
injectable materials to the cells of an animal, such as, injection into the interstitial space of 
tissues (heart, muscle, skin, colon, liver, intestine and the like). The polynucleotide 
20 constructs can be delivered in a pharmaceutically acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences that are free 
from any delivery vehicle that acts to assist, promote, or facilitate entry into the cell, 
including viral sequences, viral particles, liposome formulations, lipofectin or precipitating 
agents and the like. However, the polynucleotides of the present invention may also be 
25 delivered in liposome formulations (such as those taught in Feigner P. L. et al. Ann. NY 
Acad. Sci. 772: 126-139 (1995) and Abdallah B. et al Biol Cell 85 (1): 1-7 (1995)) which 
can be prepared by methods well known to those skilled in the art. 

The polynucleotide vector constructs used in the gene therapy method are 
preferably constructs that will not integrate into the host genome nor will they contain 
30 sequences that allow for replication. Any strong promoter known to those skilled in the art 
can be used for driving the expression of DNA. Unlike other gene therapies techniques, 
one major advantage of introducing naked nucleic acid sequences into target cells is the 
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transitory nature of the polynucleotide synthesis in the cells. Studies have shown that non- 
replicating DNA sequences can be introduced into cells to provide production of the 
desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of tissues 

5 within the an animal, including of muscle, skin, brain, colon, liver, spleen, bone marrow, 
thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, stomach, 
intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and connective tissue. 
Interstitial space of the tissues comprises the intercellular fluid, mucopolysaccharide 
matrix among the reticular fibers of organ tissues, elastic fibers in the walls of vessels or 

10 chambers, collagen fibers of fibrous tissues, or that same matrix within connective tissue 
ensheathing muscle cells or in the lacunae of bone. It is similarly the space occupied by 
the plasma of the circulation and the lymph fluid of the lymphatic channels. Delivery to 
the interstitial space of muscle tissue is preferred for the reasons discussed below. They 
may be conveniently delivered by injection into the tissues comprising these cells. They 

15 are preferably delivered to and expressed in persistent, non-dividing cells which are 

differentiated, although delivery and expression may be achieved in non-differentiated or 
less completely differentiated cells, such as, for example, stem cells of blood or skin 
fibroblasts. In vivo muscle cells are particularly competent in their ability to take up and 
express polynucleotides. 

20 For the naked polynucleotide injection, an effective dosage amount of DNA or 

RNA will be in the range of from about 0.05 pg/kg body weight to about 50 mg/kg body 
weight Preferably the dosage will be from about 0.005 mg/kg to about 20 mg/kg and 
more preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, as the artisan of 
ordinary skill will appreciate, this dosage will vary according to the tissue site of injection. 

25 The appropriate and effective dosage of nucleic acid sequence can readily be determined 
by those of ordinary skill in the art and may depend on the condition being treated and the 
route of administration. The preferred route of administration is by the parenteral route of 
injection into the interstitial space of tissues. However, other parenteral routes may also 
be used, such as, inhalation of an aerosol formulation particularly for delivery to colons or 

30 bronchial tissues, throat or mucous membranes of the nose. In addition, naked 

polynucleotide constructs can be delivered to arteries during angioplasty by the catheter 
used in the procedure. 
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The dose response effects of injected polynucleotide in muscle in vivo is 
determined as follows. Suitable template DNA for production of mRNA coding for 
polypeptide of the present invention is prepared in accordance with a standard 
recombinant DNA methodology. The template DNA, which may be either circular or 
5 linear, is either used as naked DNA or complexed with liposomes. The quadriceps 
muscles of mice are then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made on the 
anterior thigh, and the quadriceps muscle is directly visualized. The template DNA is 
10 injected in 0. 1 ml of carrier in a 1 cc syringe through a 27 gauge needle over one minute, 
approximately 0.5 cm from the distal insertion site of the muscle into the knee and about 
0.2 cm deep. A suture is placed over the injection site for future localization, and the skin 
is closed with stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) muscle extracts are prepared by 
15 excising the entire quadriceps. Every fifth 15 um cross-section of the individual 

quadriceps muscles is histochemically stained for protein expression. A time course for 
protein expression may be done in a similar fashion except that quadriceps from different 
mice are harvested at different times. Persistence of DNA in muscle following injection 
may be determined by Southern blot analysis after preparing total cellular DNA and HIRT 
20 supernatants from injected and control mice. 

The results of the above experimentation in mice can be use to extrapolate proper 
dosages and other treatment parameters in humans and other animals using naked DNA. 

Example 13: Transgenic Animals 

The polypeptides of the invention can also be expressed in transgenic animals. 
25 Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, guinea 
pigs, pigs, micro-pigs, goats, sheep, cows and non-human primates, e. g., baboons, 
monkeys, and chimpanzees may be used to generate transgenic animals. In a specific 
embodiment, techniques described herein or otherwise known in the art, are used to 
express polypeptides of the invention in humans, as part of a gene therapy protocol. 
. 30 Any technique known in the art may be used to introduce the transgene (I. e., 

polynucleotides of the invention) into animals to produce the founder lines of transgenic 
animals. Such techniques include, but are not limited to, pronuclear microinjection 
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(Paterson et al., Appl. Microbiol BiotechnoL 40: 691-698 (1994); Carver et al., 
Biotechnology 11: 1263-1270 (1993); Wright et al., Biotechnology 9: 830-834 (1991); and 
U. S. Pat No. 4,873,191, the contents of which is hereby incorporated by reference herein 
in its entirety); retrovirus mediated gene transfer into germ lines (V an der Putten et al., 
5 Proc. Natl Acad. Set, USA 82: 6148-6152 (1985)), blastocysts or embryos; gene targeting 
in embryonic stem cells (Thompson et al., Cell 56: 313-321 (1989)); electroporation of 
cells or embryos (Lo, 1983, Mol Cell Biol 3: 1803-1814 (1983)); introduction of the 
polynucleotides of the invention using a gene gun (see, e. g., Ulmer et al., Science 259: 
1745 (1993); introducing nucleic acid constructs into embryonic pleuripotent stem cells 
10 and transferring the stem cells back into the blastocyst; and sperm mediated gene transfer 
(Lavitrano et al., Cell 57: 111 -111 (1989). For a review of such techniques, see Gordon, 
"Transgenic Animals," Intl Rev. Cytol 115: 171-229 (1989). 

Any technique known in the art may be used to produce transgenic clones 
containing polynucleotides of the invention, for example, nuclear transfer into enucleated 
15 oocytes of nuclei from cultured embryonic, fetal, or adult cells induced to quiescence 
(Campell et al, Nature 380: 64-66 (1996); Wilmut et al, Nature 385: 810813 (1997)). 

The present invention provides for transgenic animals that carry the transgene in 
all their cells, as well as animals which carry the transgene in some, but not all their cells, 
I. e., mosaic animals or chimeric. The transgene may be integrated as a single transgene 
20 or as multiple copies such as in concatamers, e.g., head-to-head tandems or head-to-tail 
tandems. The transgene may also be selectively introduced into and activated in a 
particular cell type by following, for example, the teaching of Lasko et al. (Lasko et al., 
Proc. Natl Acad. Set USA 89: 6232-6236 (1992)). The regulatory sequences required for 
such a cell-type specific activation will depend upon the particular cell type of interest, 
25 and will be apparent to those of skill in the art When it is desired that the polynucleotide 
transgene be integrated into the chromosomal site of the endogenous gene, gene targeting 
is preferred. Briefly, when such a technique is to be utilized, vectors containing some 
nucleotide sequences homologous to the endogenous gene are designed for the purpose of 
integrating, via homologous recombination with chromosomal sequences, into and 
30 disrupting the function of the nucleotide sequence of the endogenous gene. The transgene 
may also be selectively introduced into a particular cell type, thus inactivating the 
endogenous gene in only that cell type, by following, for example, the teaching of Gu et 
al. (Gu et al., Science 265: 103-106 (1994)). The regulatory sequences required for such a 
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cell-type specific inactivation will depend upon the particular cell type of interest, and will 
be apparent to those of skill in the art 

Once transgenic animals have been generated, the expression of the recombinant 
gene may be assayed utilizing standard techniques. Initial screening may be accomplished 
5 by Southern blot analysis or PCR techniques to analyze animal tissues to verify that 
integration of the trans gene has taken place. The level of mRNA expression of the 
transgene in the tissues of the transgenic animals may also be assessed using techniques 
which include, but are not limited to, Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and reverse transcriptase-PCR (rt-PCR). 
10 Samples of transgenic gene-expressing tissue may also be evaluated 

immunocytochemically or immunohistochemically using antibodies specific for the 
transgene product. 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal. Examples of such breeding 

15 strategies include, but are not limited to: outbreeding of founder animals with more than 
one integration site in order to establish separate lines; inbreeding of separate lines in 
order to produce compound transgenics that express the transgene at higher levels because 
of the effects of additive expression of each transgene; crossing of heterozygous 
transgenic animals to produce animals homozygous for a given integration site in order to 

20 both augment expression and eliminate the need for screening of animals by DNA 

analysis; crossing of separate homozygous lines to produce compound heterozygous or 
homozygous lines; and breeding to place the transgene on a distinct background that is 
appropriate for an experimental model of interest 

Transgenic animals of the invention have uses which include, but are not limited 

25 to, animal model systems useful in elaborating the biological function of polypeptides of 
the present invention, studying conditions and/or disorders associated with aberrant 
expression, and in screening for compounds effective in ameliorating such conditions 
and/or disorders. 

Example 14: Knock-Out Animals 

30 Endogenous gene expression can also be reduced by inactivating or "knocking out" 

the gene and/or its promoter using targeted homologous recombination. (E. g., see 
Smithies et al., Nature 317: 230-234 (1985); Thomas & Capecchi, Cell 51: 503512 (1987); 
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Thompson et al., Cell 5: 313-321 (1989)) Alternatively, RNAi technology may be used. 
For example, a mutant, non-functional polynucleotide of the invention (or a completely 
unrelated DNA sequence) flanked by DNA homologous to the endogenous polynucleotide 
sequence (either the coding regions or regulatory regions of the gene) can be used, with or 
5 without a selectable marker and/or a negative selectable marker, to transfect cells that 
express polypeptides of the invention in vivo. In another embodiment, techniques known 
in the art are used to generate knockouts in cells that contain, but do not express the gene 
of interest Insertion of the DNA construct, via targeted homologous recombination, 
results in inactivation of the targeted gene. Such approaches are particularly suited in 

10 research and agricultural fields where modifications to embryonic stem cells can be used 
to generate animal offspring with an inactive targeted gene (e. g., see Thomas & Capecchi 
1987 and Thompson 1989, supra). However, this approach can be routinely adapted for 
use in humans provided the recombinant DNA constructs are directly administered or 
targeted to the required site in vivo using appropriate viral vectors that will be apparent to 

1 5 those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered to 
express the polypeptides of the invention, or alternatively, that are genetically engineered 
not to express the polypeptides of the invention (e. g., knockouts) are administered to a 
patient in vivo. Such cells may be obtained from the patient (i.e., animal, including 

20 human) or an MHC compatible donor and can include, but are not limited to fibroblasts, 
bone marrow cells, blood cells (e. g., lymphocytes), adipocytes, muscle cells, endothelial 
cells etc. The cells are genetically engineered in vitro using recombinant DNA techniques 
to introduce the coding sequence of polypeptides of the invention into the cells, or 
alternatively, to disrupt the coding sequence and/or endogenous regulatory sequence 

25 associated with the polypeptides of the invention, e.g., by transduction (using viral 
vectors, and preferably vectors that integrate the transgene into the cell genome) or 
transfection procedures, including, but not limited to, the use of plasmids, cosmids, YACs, 
naked DNA, electroporation, liposomes, etc. 

The coding sequence of the polypeptides of the invention can be placed under the 

30 control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 

expression, and preferably secretion, of the polypeptides of the invention. The engineered 
cells which express and preferably secrete the polypeptides of the invention can be 
introduced into the patient systemically, e. g., in the circulation, or intraperitoneally. 
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Alternatively, the cells can be incorporated into a matrix and implanted in the 
body, e. g., genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft (See, for example, Anderson et al. U. S. Patent No. 5,399,349; and 
5 Mulligan & Wilson, U. S. Patent No. 5,460,959, the contents of which are hereby 
incorporated by reference herein in their entirety). 

When the cells to be administered are non-autologous or non-MHC compatible 
cells, they can be administered using well known techniques which prevent the 
development of a host immune response against the introduced cells. For example, the 
10 cells may be introduced in an encapsulated form which, while allowing for an exchange of 
components with the immediate extracellular environment, does not allow the introduced 
cells to be recognized by the host immune system. 

Transgenic and "knock-out 11 animals of the invention have uses which include, but 
are not limited to, animal model systems useful in elaborating the biological function of 
1 5 polypeptides of the present invention, studying conditions and/or disorders associated with 
aberrant expression, and in screening for compounds effective in ameliorating such 
conditions and/or disorders. 

While preferred illustrative embodiments of the present invention are described, 
one skilled in the art will appreciate that the present invention can be practiced by other 
20 than the described embodiments, which are presented for purposes of illustration only and 
not by way of limitation. The present invention is limited only by the claims that follow. 
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We claim: 

1 . An isolated nucleic acid molecule comprising: 

(a) a nucleic acid molecule comprising a nucleic acid sequence that encodes an 
amino acid sequence of SEQ ID NO: 1 13-259; 
5 (b) a nucleic acid molecule comprising a nucleic acid sequence of SEQ ID NO: 

1-112; 

(c) a nucleic acid molecule that selectively hybridizes to the nucleic acid 
molecule of (a) or (b); or 

(d) a nucleic acid molecule having at least 95% sequence identity to the nucleic 
10 acid molecule of (a) or (b). 

2. The nucleic acid molecule according to claim 1, wherein the nucleic acid molecule 
is a cDNA. 

15 3. The nucleic acid molecule according to claim 1 , wherein the nucleic acid molecule 
is genomic DNA. 

4. The nucleic acid molecule according to claim 1, wherein the nucleic acid molecule 
is an RNA. 

20 

5. The nucleic acid molecule according to claim 1, wherein the nucleic acid molecule 
is a mammalian nucleic acid molecule. 

6. The nucleic acid molecule according to claim 5, wherein the nucleic acid molecule 
25 is a human nucleic acid molecule. 

7. A method for determining the presence of a colon specific nucleic acid (CSNA) in 
a sample, comprising the steps of: 

(a) contacting the sample with the nucleic acid molecule of SEQ ID NO: 1-1 12 
30 under conditions in which the nucleic acid molecule will selectively hybridize to a 

colon specific nucleic acid; and 
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(b) detecting hybridization of the nucleic acid molecule to a CSNA in the 
sample, wherein the detection of the hybridization indicates the presence of a 
CSNA in the sample. 



5 8. A vector comprising the nucleic acid molecule of claim 1 . 
9. A host cell comprising the vector according to claim 8. 



10. A method for producing a polypeptide encoded by the nucleic acid molecule 
10 according to claim 1, comprising the steps of: v 

(a) providing a host cell comprising the nucleic acid molecule operably linked 
to one or more expression control sequences, and 

(b) incubating the host cell under conditions in which the polypeptide is 
produced. 

15 

11. A polypeptide encoded by the nucleic acid molecule according to claim 1 . 



12. An isolated polypeptide selected from the group consisting of: 

(a) a polypeptide comprising an amino acid sequence with at least 95% 
20 sequence identity to of SEQ ID NO: 1 13-259 ; or 

(b) a polypeptide comprising an amino acid sequence encoded by a nucleic 
acid molecule having at least 95% sequence identity to a nucleic acid molecule 
comprising a nucleic acid sequence of SEQ ID NO: 1-1 12. 

25 13. An antibody or fragment thereof that specifically binds to: 

(a) a polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to of SEQ ID NO: 1 13-259 ; or 

(b) a polypeptide comprising an amino acid sequence encoded by a nucleic 
acid molecule having at least 95% sequence identity to a nucleic acid molecule 

30 comprising a nucleic acid sequence of SEQ ID NO: 1-1 12. 



14. A method for determining the presence of a colon specific protein in a sample, 
comprising the steps of: 
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(a) contacting the sample with a suitable reagent under conditions in which the 
reagent will selectively interact with the colon specific protein comprising an 
amino acid sequence with at least 95% sequence identity to of SEQ ID NO: 113- 
259; and 

5 (b) detecting the interaction of the reagent with a colon specific protein in the 

sample, wherein the detection of binding indicates the presence of a colon specific 
protein in the sample. 

15. A method for diagnosing or monitoring the presence and metastases of colon 
10 cancer in a patient, comprising the steps of: 
(a) determining an amount of: 

(i) a nucleic acid molecule comprising a nucleic acid sequence that 
encodes an amino acid sequence of SEQ ID NO: 1 13-259; 

(ii) a nucleic acid molecule comprising a nucleic acid sequence of SEQ 
15 ID NO: 1-112; 

(iii) a nucleic acid molecule that selectively hybridizes to the nucleic 
acid molecule of (i) or (ii); 

(iv) a nucleic acid molecule having at least 95% sequence identity to the 
nucleic acid molecule of (i) or (ii); 

20 (v) a polypeptide comprising an amino acid sequence with at least 95% 

sequence identity to of SEQ ID NO: 1 13-259 ; or 
(vi) a polypeptide comprising an amino acid sequence encoded by a 
nucleic acid molecule having at least 95% sequence identity to a nucleic 
acid molecule comprising a nucleic acid sequence of SEQ ID NO: 1-1 12 
25 and; 

(b) comparing the amount of the determined nucleic acid molecule or the 
polypeptide in the sample of the patient to the amount of the colon specific marker 
in a normal control; wherein a difference in the amount of the nucleic acid 
molecule or the polypeptide in the sample compared to the amount of the nucleic 
30 acid molecule or the polypeptide in the normal control is associated with the 

presence of colon cancer. 
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16. A kit for detecting a risk of cancer or presence of cancer in a patient, said kit 
comprising a means for determining the presence of: 

(a) a nucleic acid molecule comprising a nucleic acid sequence that encodes an 

amino acid sequence of SEQ ID NO: 1 13-259; 
5 (b) a nucleic acid molecule comprising a nucleic acid sequence of SEQ ID NO: 

1-112; 

(c) a nucleic acid molecule that selectively hybridizes to the nucleic acid 
molecule of (a) or (b); or 

(d) a nucleic acid molecule having at least 95% sequence identity to the nucleic 
10 acid molecule of (a) or (b); or 

(e) a polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to of SEQ ID NO: 1 13-259 ; or 

(f) a polypeptide comprising an amino acid sequence encoded by a nucleic 
acid molecule having at least 95% sequence identity to a nucleic acid molecule 

1 5 comprising a nucleic acid sequence of SEQ ID NO: 1-112. 

17. A method of treating a patient with colon cancer, comprising the step of 
administering a composition consisting of: 

(a) a nucleic acid molecule comprising a nucleic acid sequence that encodes an 
20 amino acid sequence of SEQ ID NO: 1 13-259; 

(b) a nucleic acid molecule comprising a nucleic acid sequence of SEQ ID NO: 
1-112; 

(c) a nucleic acid molecule that selectively hybridizes to the nucleic acid 
molecule of (a) or (b); 

25 (d) a nucleic acid molecule having at least 95% sequence identity to the nucleic 

acid molecule of (a) or (b); 

(e) a polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to of SEQ ID NO: 1 13-259 ; or 

(f) a polypeptide comprising an amino acid sequence encoded by a nucleic 
30 acid molecule having at least 95% sequence identity to a nucleic acid molecule 

comprising a nucleic acid sequence of SEQ ID NO: 1-112; 
to a patient in need thereof, wherein said administration induces an immune response 
against the colon cancer cell expressing the nucleic acid molecule or polypeptide. 
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18. A vaccine comprising the polypeptide or the nucleic acid encoding the polypeptide 
of claim 12. 



5 
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<130> DEX-0449 
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<151> 2002-12-04 

<150> US 60/431,206 
<151> 2002-12-04 

<160> 259 

<170> Patentln version 3.1 

<210> 1 

<211> 1101 

<212> DNA 

<213> Homo sapien 

<400> 1 

gggsgccggr rccgcgccgg gccggctcca gggcggccgc ggcggtgggg cgcaggccgc 6 0 

cgcgcgagtg aatcgaggcg gcgggcccat ccggaaccgg ccggccatcg cccgcggcgc 120 

ggccggcgga ggcggcagga accgaccggc gccctacagc aggccaaaac aacttcccga 180 

caagtggcag cacgatcttt tcgacagtgg cttcggcggt ggtgccggcg tggagacagg 240 

tgggaaactg ctggtgtcca atctggattt tggagtctca gacgccgata ttcaggaact 300 

ctttgctgaa tttggaacgc tgaagaaggc ggctgtgcac tatgatcgct ctggtcgcag 3 60 

cttaggaaca gcagacgtgc actttgagcg gaaggcagat gccctgaagg ccatgaagca 420 

gtacaacggc gtccctctgg atgcaagcta catccctccc ctccttcagc tccttccgga 480 

agattcactt ctgtgaagcc ttctcagaac cgccatagta cacctctgag tgcggggctg 540 

tagtcgtaaa gacaaaggcc gccccatgaa cattcagctt gtcacgtcac agattgacgc 600 

acagcggagg cctgcacaga gcgtaaacag aggtggcatg actagaaacc gtggcgctgg 660 

aggttttggt ggtggtggag gcacccggag aggcacccgc ggaggcgccc gtggaagagg 72 0 

cagaggtgcc ggcaggaatt caaagcagca gctttcggca gaggagctgg atgcccagct 780 

ggacgcctat aatgcgagaa tggacaccag ttaaacagac cagcaaatcc gcgtgcggaa 840 

caggacccag gcgtctcctc ttgctccctg gttggggggc ggtggctggg gctgtgcggc 900 

caatgatgga tttgtttctt ttatgtttta aaataggatt taaaaactca tgtaaaggtt 960 

tttttttttt cttttttttt ttttttaatt ctgaaacaga cctgttttgt accgagttat 1020 
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ttttgggata aattttactg gttgctgttg tggagaaggt ggcgtttcca ccttttccat 1080 

aataaaatag aaatgtgtgt a I 101 

<210> 2 
<211> 1022 
<212> DNA 
<213> Homo sapien 

<400> 2 

cgaacgcctt cgcgcgatcg ccctggaaac gcattctctg cgaccggcag ccgccaatgg 60 

gaagggagtg agtgccacga acaggccaat aaggagggag cagtgcgggg tttaaatctg 120 

aggctaggct ggctcttctc ggcgtgctgc ggcggaacgg ctgttggttt ctgctggttg 180 

taggtccttg gctggtcggg cctccggtgt tctgcttctc cccgctgagc tgctgcctgg 240 

tgaagaggaa gccatggcgc tccgagtcac caggaactcg aaaattaatg ctgaaaataa 300 

ggcgaagatc aacatggcag gcgcaaagcg cgttcctacg gcccctgctg caacctccaa 36 0 

gcccggactg aggccaagaa cagctcttgg ggacattggt aacaaagtca gtgaacaact 42 0 

gcaggccaaa atgcctatga agaaggaagc aaaaccttca gctactggaa aagtcattga 480 

taaaaaacta ccaaaacctc ttgaaaaggt acctatgctg gtgccagtgc cagtgtctga 54 0 

gccagtgcca gagccagaac ctgagccaga acctgagcct gttaaagaag aaaaactttc 600 

gcctgagcct attttggttg atactgcctc tccaagccca atggaaacat ctggatgtgc 660 

ccctgcagaa gaagacctgt gtcaggcttt ctctgatgta attcttgcag taaatgatgt 720 

ggatgcagaa gatggagctg atccaaacct ttgtagtgaa tatgtgaaag atatttatgc 780 

ttatctgaga caacttgagg aagagcaagc agtcagacca aaatacctac tgggtcggga 84 0 

agtcactgga aacatgagag ccatcctaat tgactggtag tacaggttca aatgaaatcc 900 

gtgtgtgcgg gacctgtctg cggtcctatt gacgggccgg caaaattggg tgcccaaatt 960 

gctggggggc cggccgtttg gcccttgaaa ggccccaggg ggcgttgggg gaccttagca 1020 
ga 



1022 



<210> 3 

<211> 6046 

<212> DNA 

<213> Homo sapien 

<400> 3 

acagtgtcat ggaggccgcg gcggtgacgg tcacccggtc ggctacacgg cggcggcggc 60 

ggcagctgca ggggctggcg gccccggagg cggggacgca ggaggagcag gaggatcagg 120 

agccgcggcc gcggcggcgg cggccgggaa ggagcatcaa agatgaagaa gaagagacag 180 
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tctttcgaga ggtggtcagt ttttccccgg accccctgcc agttagatat tatgacaagg 240 

acaccaccaa accaatcagc ttttacttgt cttcgctgga ggagctcttg gcgtggaagc 300 

cccgcttgga ggatggcttt aatgtggccc tggagcccct ggcgtgtcgc cagccccctc 360 

tgagcagcca gaggccccgg actttgttgt gtcatgacat gatgggcggg tacctggatg 420 

acaggttcat tcagggctcg gtggtgcaga ctccctatgc tttctaccac tggcagtgca 480 

tcgacgtctt tgtgtacttc agccaccaca ccgtcaccat tcccccagtg ggctggacca 540 

acactgccca caggcatggg gtctgcgtgc tggggacttt catcacggag tggaatgaag 600 

ggggaaggct ctgtgaagcc ttcctggccg gggatgagcg ctcgtaccag gcagtggctg 660 

accggctggt ccagatcact cagttttttc gttttgatgg ctggctgatc aacatcgaga 720 

actcgctgag tctggccgct gtggggaaca tgcctccttt cctgcggtac ctcaccacac 780 

agctgcaccg gcaggtccca gggggcctgg tgctctggta tgacagcgtg gtgcaaagtg 840 

ggcagctcaa atggcaagac gaactcaacc agcacaacag ggtcttcttt gattcctgcg 900 

acggcttctt cactaactat aactggcggg aggagcactt ggagcggatg ctggggcagg 960 

ctggggagcg ccgggctgat gtgtacgtgg gcgtggatgt gtttgctcga gggaacgtgg 1020 . 

tcggaggccg attcgacaca gacaaggtgg gtggtggctt tcgtccaagg gccagcggcc 1080 

cagtgcctcc cctgggacct catttcctca tggaccttcc attcccgtct gcacctcaaa 114 0 

ggaatgacag cagctgttct tcccagagtg gggatcccgt ggcactgaga aacaggtgtc 1200 

ctgccccggc caagctatgt ccccactgaa acccctttcc cggctatttc tacaatcaga 1260 

taatgtattt ttgtgtgggt gttaccagac agaggcgcgt cgttcccccg ctgtccggtc 132 0 

ctcggcgggc ctgggcctcc tgtcccagct gcgttccttg atcttccccg tgagggaagc 1380 

ctgcggggac gcacaggggc ctgcagcgcc tctcactgcg gctcaggggc gaggacgggg 1440 

gaggtgccca gggtttgggg atcggtaggt gtcatgacga gggagcattt tctggcaagg 1500 

caggtcacac cagcacagag gaaggagcgg ggccaggccg gtgcccctgg cgtgggatgc 1560 

gcagggctct gcctgcctct ggtctatcag tcctgcccgt tcccagagcc ccagggatgt 1620 

tctcagccct taatttagct ttaaatgagt aaattaatcc aataactaaa gagcctcttt 1680 

actgcccgtg attctgataa tcgaatgaga ggtgtgccag aggacaatgg gaccgcgcag 1740 

ccacagccag ttggggccca gcccctgccc ccctgccctt gcagcagcgc tgaccgcctc 1800 

tcccgcgccc tcttctggcg ggcttgtgga acgacggggg tgatgcccag cagcctccag 1860 

gtgcatctag gagggagggg ttggacagct tccccctccc gtttctgagc tgccctgaga 1920 

gccccaaccc acgtcactgg cgtccaggag cctgtctgga ctctggtcct gatgtgccca 1980 

gcgtctccct ctgtctcttc agtcgttgga gctgatccga aagcatggct tctccgtggc 204 0 
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tttgtttgcc 


cccggctggg 


tgtatgagtg 


tctggagaag 


aaggatttct 


tccagaacca 


2100 


ggacaagttc 


tggggccgac 


tggagcgtta 


tctgcccaca 


catagcatct 


gctccttgcc 


2160 


tttcgtcacg 


tccttctgcc 


tgggcatggg 


tgcacggagg 


gtctgctatg 


gccaggaaga 


2220 


ggcggtaggg 


ccctggtacc 


acctgagcgc 


ccaggagatc 


cagcccttgt 


ttggagaaca 


2280 


caggctggga 


ggggatggcc 


ggggctgggt 


gaggacgcac 


tgctgcctgg 


aggatgcctg 


2340 


gcacggaggc 


agctccctgc 


tcgtccgggg 


tgtgatccca 


ccggaggttg 


gaaatgtggc 


2400 


tgtgaggtgg 


gtgagtgacg 


gaggacggtg 


ggcccaccag 


cttctcccat 


cacacgtggt 


2460 


ggccatggaa 


ctggacagat 


ggggcagtgg 


aggccagaac 


aaggacagag 


gacagaccca 


2520 


gatgggtttt 


ttaaaattgt 


gatcaaaaat 


gcattaacgt 


gaaattcacc 


atcttgaccc 


2580 


tgttgaagtg 


aacggtacag 


gagtgttaac 


tctgcgtacg 


ttgctacgtg 


acagatccct 


2640 


agaactgtgt 


catcttgtag 


gactggactc 


tccccattgc 


acattagccc 


cccatcctcc 


2700 


cccagccccc 


caaagccacc 


cctttactct 


gcatctgtga 


gtttggatac 


tttagatgcc 


2760 


ccatatgagg 


gaatcccata 


cggtatttca 


gaactgagat 


gtttcagccc 


acaaaagaat 


2820 


ctggccttgg 


gggaggactt 


gtcccctggc 


tacggacagg 


accacgatgt 


gggagcattt 


2880 


ggcacccagg 


ccccctgttc 


ctggagggag 


gggctggtgg 


acgcgatttg 


gagctggctt 


2940 


cgatttctca 


gtggtctctc 


cacggcaccc 


atcacaggac 


ctttttcccc 


aggttatttt 


3000 


tattccctgc 


aggccccagt 


gccacccaag 


atttacctgt 


ccatggtgta 


taagcttgag 


3060 


gggcccacgg 


acgtcacagt 


tgctttggag 


ctgaccacag 


gggatgccgg 


cagctgccac 


3120 


atcggtggca 


tctcagtgtt 


gaacgcagaa 


acaagctcaa 


gacacagcct 


ccgacccctc 


3180 


cgggtgcccc 


ccaccaagct 


ggccagatgg 


gtgggccgct 


gcggccggca 


gctgagtggg 


3240 


ggctgggtcc 


agcactgcta 


cgaggtgagc 


ctgcgtgggt 


gcctgctgct 


agacctcctc 


3300 


gtttgcttct 


cacggccgcc 


gggtagtcgg 


gaggaggaga 


gcttcacctg 


tcggcttgga 


3360 


gagatccagg 


tggtggacgc 


tgccagcctg 


ctggcccctc 


tgccccaggt 


gcaggccgtc 


3420 


accatctctc 


acatccgctg 


gcagccatcc 


gcctctgagc 


gggaggggcc 


ccctgctctg 


3480 


ctccagctca 


gctgcaccct 


gcactggtcc 


ttcctcctct 


cacaagtccg 


ttgcttccga 


3540 


atccactgct 


ggggagggat 


gagtgatgac 


tctccgggca 


gggagctgcc 


gaggccagag 


3600 


atgcccatgt 


tcctggggtt 


ggcttttgcc 


acccagtacc 


ggatagtgga 


cctgctggtg 


3660 


gaagccgccg 


ggcccggcca 


ggatcgtcgc 


atggaatttc 


tggtggagcc 


tgtccccaag 


3720 


gaagggttcc 


gggtacctca 


ggccgagtgg 


ggcagggcag 


ttctgcttta 


ttcagcccct 


3780 


gcatgagcgg 


atgctaaggc 


cgggtggtct 


cctggcctcg 


ggctgaggcc 


tcttcccggc 


3840 
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aaggatatgg 


ggcaggaggc 


5460 


tccctctgct 


gaccccctgc 


agectggage 


cagcccgggg 


actgtcctgg 


gtggagggca 


5520 


ggtgaacaca 


agctgctgcc 


ggggaccgtc 


c ^gggtggac 


ggcaggtgaa 


cacaagcggc 


5580 


tgccgcatgt 


agccactcac 


tcgacttttt 


ttcagctgtg 


accattcctg 


ggagctcttt 


5640 


gagcctttct 


gtctcatttg 


gaaccagggg 


gaaccaggaa 


ggggcccctg 


gcctctctgt 


5700 
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gtcctctgca 


gtgggggttg tggggggcgc agcatccacg 


ccttgctgcc 


cttctttcat 


5760 


gaagtctgtt 


ttttaagtgc tggttccccc gaatatttta 


tgcagaggag 


ggaaaattta 


5820 


tagtggcaat 


tattttctca cagtctggtg agcaggcaat 


taattaggag 


taagggggcc 


5880 


tagtagagcg 


tggcgtgtgg cagaatcgca ccgccccggc 


tccccagccc 


accgccatgc 


5940 


agggctcgcg 


tgcgggaaaa ctaatatgcc ggcgtttaag 


cctgtgcccc 


tctgctgggt 


6000 


gtaactgcgc 


tgaaataaat gatctgacaa tgtgaaaaaa 






o u *± o 


<210> 4 

<211> 5839 

<212> DNA 

<213> Homo sapien 








<400> 4 
acagtgtcat 


ggaggccgcg gcggtgacgg tcacccggtc 


ggctacacgg 


cggcggcggc 


60 


ggcagctgca 


ggggctggcg gccccggagg cggggacgca 


ggaggagcag 


gaggatcagg 


120 


agccgcggcc 


gcggcggcgg cggccgggaa ggagcatcaa 


agatgaagaa 


gaagagacag 


180 


tctttcgaga 


ggtggtcagt ttttccccgg accccctgcc 


agttagatat 


tatgacaagg 


240 


acaccaccaa 


accaatcagc ttttacttgt cttcgctgga 


ggagctcttg 


gcgtggaagc 


300 


cccgcttgga 


ggatggcttt aatgtggccc tggagcccct 


ggcgtgtcgc 


cagccccctc 


360 


tgagcagcca 


gaggccccgg actttgttgt gtcatgacat 


gatgggcggg 


tacctggatg 


420 


acaggttcat 


tcagggctcg gtggtgcaga ctccctatgc 


tttctaccac 


tggcagtgca 


480 


tcgacgtctt 


tgtgtacttc agccaccaca ccgtcaccat 


tcccccagtg ggctggacca 


540 


acactgccca 


caggcatggg gtctgcgtgc tggggacttt 


catcacggag 


tggaatgaag 


600 


ggggaaggct 


ctgtgaagcc ttcctggccg gggatgagcg 


ctcgtaccag 


gcagtggctg 


660 


accggctggt 


ccagatcact cagttttttc gttttgatgg 


ctggctgatc 


aacatcgaga 


720 


actcgctgag 


tctggccgct gtggggaaca tgcctccttt 


cctgcggtac 


ctcaccacac 


780 


agctgcaccg 


gcaggtccca gggggcctgg tgctctggta 


tgacagcgtg 


gtgcaaagtg 


840 


ggcagctcaa 


atggcaagac gaactcaacc agcacaacag ggtcttcttt 


gattcctgcg 


900 


acggcttctt 


cactaactat aactggcggg aggagcactt 


ggagcggatg 


ctggggcagg 


960 


ctggggagcg 


ccgggctgat gtgtacgtgg gcgtggatgt 


gtttgctcga 


gggaacgtgg 


1020 


tcggaggccg 


attcgacaca gacaaggtgg gtggtggctt 


tcgtccaagg 


gccagcggcc 


1080 


cagtgcctcc 


cctgggacct catttcctca tggaccttcc attcccgtct 


gcacctcaaa 


1140 


ggaatgacag 


cagctgttct tcccagagtg gggatcccgt 


ggcactgaga 


aacaggtgtc 


1200 


ctgccccggc 


caagctatgt ccccactgaa acccctttcc 


cggctatttc 


tacaatcaga 


1260 



WO 2004/050900 



PCT/US2003/040131 



7 

taatgtattt ttgtgtgggt gttaccagac agaggcgcgt cgttcccccg ctgtccggtc 1320 

ctcggcgggc ctgggcctcc tgtcccagct gcgttccttg atcttccccg tgagggaagc 1380 

ctgcggggac gcacaggggc ctgcagcgcc tctcactgcg gctcaggggc gaggacgggg 1440 

gaggtgccca gggtttgggg atcggtaggt gtcatgacga gggagcattt tctggcaagg 1500 

caggtcacac cagcacagag gaaggagcgg ggccaggccg gtgcccctgg cgtgggatgc 1560 

gcagggctct gcctgcctct ggtctatcag tcctgcccgt tcccagagcc tctttactgc 1620 

ccgtgattct gataatcgaa tgagaggtgt gccagaggac aatgggaccg cgcagccaca 1680 

gccagttggg gcccagcccc tgcccccctg cccttgcagc agcgctgacc gcctctcccg 1740 

cgccctcttc tggcgggctt gtggaacgac gggggtgatg cccagcagcc tccagtcgtt 1800 

ggagctgatc cgaaagcatg gcttctccgt ggctttgttt gcccccggct gggtgtatga 1860 

gtgtctggag aagaaggatt tcttccagaa ccaggacaag ttctggggcc gactggagcg 1920 

ttatctgccc acacatagca tctgctcctt gcctttcgtc acgtccttct gcctgggcat 1980 

gggtgcacgg agggtctgct atggccagga agaggcggta gggccctggt accacctgag 2 040 

cgcccaggag atccagccct tgtttggaga acacaggctg ggaggggatg gccggggctg 2100 

ggtgaggacg cactgctgcc tggaggatgc ctggcacgga ggcagctccc tgctcgtccg 2160 

gggtgtgatc ccaccggagg ttggaaatgt ggctgtgagg tgggtgagtg acggaggacg 2220 

gtgggcccac cagcttctcc catcacacgt ggtggccatg gaactggaca gatggggcag 2280 

tggaggccag aacaaggaca gaggacagac ccagatgggt tttttaaaat tgtgatcaaa 2340 

aatgcattaa cgtgaaattc accatcttga ccctgttgaa gtgaacggta caggagtgtt 2400 

aactctgcgt acgttgctac gtgacagatc cctagaactg tgtcatcttg taggactgga 2460 

ctctccccat tgcacattag ccccccatcc tcccccagcc ccccaaagcc acccctttac 2520 

tctgcatctg tgagtttgga tactttagat gccccatatg agggaatccc atacggtatt 2580 

tcagaactga gatgtttcag cccacaaaag aatctggcct tgggggagga cttgtcccct 2640 

ggctacggac aggaccacga tgtgggagca tttggcaccc aggccccctg ttcctggagg 2700 

gaggggctgg tggacgcgat ttggagctgg cttcgatttc tcagtggtct ctccacggca 2760 

cccatcacag gacctttttc cccaggttat ttttattccc tgcaggcccc agtgccaccc 2820 

aagatttacc tgtccatggt gtataagctt gaggggccca cggacgtcac agttgctttg 2880 

gagctgacca caggggatgc cggcagctgc cacatcggtg gcatctcagt gttgaacgca 2940 

gaaacaagct caagacacag cctccgaccc ctccgggtgc cccccaccaa gctggccaga 3000 

tgggtgggcc gctgcggccg gcagctgagt gggggctggg tccagcactg ctacgaggtg 3060 
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^•y *-v*uy v-y cy 










geegggcagu 


J 1^5 U 


c yyg a ggagg 


agagc u t. cac 


ccgccggccc 


ggagagaccc 


a g9t9gcgga 


cgccgccagc 


o ^ o r* 
3 1B0 


CtyCtygcCC 


CCCLyCCCCa 


ggcgcaggcc 


gccaccaccc 


cccacatccg 


ctggcagcca 


3240 




agegggaggg 


gccccctgct 


ccgctccagc 


ccagccgcac 


cctgcactgg 


*a o A A 
3300 




UCUCauaayL 


ccyc Ly t,t lc 


cgaacccacc 


gctggggagg 


gatgagtgat 


3360 




y Gctyyyciy t» L. 


gcuyagyecd. 


rrarrah rtr+i-> »*^aa 

yagaLyccca 


tgLLCCLggg 


gccggccntc 


3420 




ra r'^orra t" anh 
ci you cty t_ 


gyacctgety 


y egg cidy ccg 


ccgggcccgg 


ecaggategt 


^ j\ a t\ 
3480 




t. tutyy tyya 


yuwcytcccc 


a aggaa y y y t 


cccgggcacc 


teaggecgag 


3540 


cggggcaggg 


«a /Tt* 4* 4— /writ' 

cage tccgcc 


LtatCCayCC 


ectgeatgag 


eggatgetaa 


ggccgggtgg 


3600 


ucLcccggcc 


tegggctgag 


gcctcttccc 


ggctgtctgc 


ccctggcctg 


cgctggacct 


3660 


gctaagcgcc 


cacagtggca 


gcgaggt ccc 


ggtcccgggg 


ctggggtggg 


agaccccggg 


3720 


*—L-y ay uy Lty 


uy y l. LLLLy 




aeggaaacag 


gaaaccaagc 


^tgggatcg 


3780 


^»y eg t tyg i- 


cactgegagg 




gctttccgct 


tetgect tgt 


ccctccccac 


3 840 


frrtl - s /-i ft /""rr-ri- 


#-i /-i ft /t ^* ^ 
Lcccaggt.ga 


aaatzgaaagg 


a 9*g99 a 9 aa g 


tfcgagaacag 


aacattccat 


3900 


a a a rma haH" 
aaagyauaLL 


tccLaatagg 


ctgcaagatg 


ctgatgeega 


gaatgatgat 


tttctttcct 


3960 


yGaycl ugaclct 


ccatuagaaa 


gggtcttaga 


ttgtggcagg 


taggctttgg 


agcaggcgcc 


4020 


na rta fsa f~ fr* J - r*» 
yay duct llll. 


tgag cacgag 


gacgagctac 


agcagctcct 


ggggtggggc 


tgcctgcggg 


4080 


«* ^yy'-gyyay 


«yyci cgccct 


ggagaaccgn 


cct cc cagtg 


tggaaggece 


ttttccctga 


4140 


yy a y i -yyy t - a 


l. l. v_ ^yyy-ua 


gcuggcgctg 


gctccgLgcc 


tccacgtggg 


ccagccccag 


4200 




utucctggcg 


c uggcaacne 


actgtgctgc 


tgagtgtgag 


gtcatctccg 


4260 


y«y ^y 


»yc«gccccL 


ggcuc cgegg 


cgfcctcfcfccc 


gggctgtggg 


catgeaggga 


4320 


«y Lgyctctg 


aggcagt ccg 


cgctgtggcc 


ctgcctctgc 


c cagegagag 


gccgtgggct 


4380 


w L*yyav«ciciyL> 


/~t r^r ^ ♦» ^ m 

oycccLtcag 


gctggggtag 


caggtcagtc 


caggcaggaa 


gcagcacctg 


4440 




^ a {~tf-> f* 
^ayccCdgcc 


ccagccugag 


tgeaggaget 


gcaggacccg 


egggggcttt 


4500 




pf-nf t- r* t- 4- <-i 


acgccccccc 


*- *- _ 1_ — . /-^ ^ ^-i 4- 

uccccagcct 


cgtccaagca 


cegggaagae 


4560 




s ^ i"* /"^ ^ 4" «a #^«r 

accccttgag 


cagcagucag 


cacaggtgcg 


tgggggcgtg 


agggaggcag 


4620 


ggtcttcacc 


acaggcgcct 


tcctctgtcc 


ttcctgctct 


ttcttctctg 


cccaggccgc 


4680 


tgeagctgea 


cagcctctgc 


tacacctggg 


ctgcctggga 


ggcttcctgg 


tgtggtgtct 


4740 


ggaccccacg 


gccttgggtc 


atcctgtggc 


tggtctgggg 


tggggtctgt 


tgtggtcctt 


4800 


ccacggtgtc 


agtggcctga 


agtccctcgc 


ttttgggggg 


ggggtctctc 


acccccaggc 


4860 


cacatagggc 


cagtggtagg 


ggttccctct 


atgtcgggca 


gtgctgaggg 


ctgggatgct 


4920 
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ctgtgacccc agctggagcc cacacctaag ggctggcatc cacatcattt caccctgcag 4980 

tgagggaaga ggccaccagg tggcagcaca gccacacccg ttcccacgtc agaggagggc 5040 

aaggctgggt actcagcagc cactctgagc cggggctcct tccaggagct gaaatccacc 5100 

tgtctccatc ttccttgcct gcctgggtac tcatgccaag cagagactgg gattaggggt 5160 

tctgtgctct tgcctaatta ggaacattct cccatgtctc ttgtgtggtc ccagaaggag 5220 

aagtgagttt gccaaggata tggggcagga ggctccctct gctgaccccc tgcagcctgg 5280 

agccagcccg gggactgtcc tgggtggagg gcaggtgaac acaagctgct gccggggact 5340 

gtcctgggtg gacggcaggt gaacacaagc ggctgccgca tgtagccact cactcgactt 5400 

tttttcagct gtgaccattc ctgggagctc tttgagcctt tctgtctcat ttggaaccag 5460 

ggggaaccag gaaggggctc ctggcctctc tgtgtcctct gcagtggggg ttgtgggggg 552 0 

cgcagcatcc acgccttgct gcccttcttt catgaagtct gttttttaag tgctggttcc 5580 

cccgaatatt ttatgcagag gagggaaaat ttatagtggc aattattttc tcacagtctg 564 0 

gtgagcaggc aattaattag gagtaagggg gcctagtaga gcgtggcgtg tggcagaatc 5700 

gcaccgcccc ggctccccag cccaccgcca tgcagggctc gcgtgcggga aaactaatat 5760 

gccggcgttt aagcctgtgc ccctctgctg ggtgtaactg cgctgaaata aatgatctga 5820 

caatgtgaaa aaaaaaaaa 5839 

<210> 5 

<211> 5090 

<212> DNA 

<213> Homo sapien 

<400> 5 

acagtgtcat ggaggccgcg gcggtgacgg tcacccggtc ggctacacgg cggcggcggc 60 

ggcagctgca ggggctggcg gccccggagg cggggacgca ggaggagcag gaggatcagg 120 

agccgcggcc gcggcggcgg cggccgggaa ggagcatcaa agatgaagaa gaagagacag 180 

tctttcgaga ggtggtcagt ttttccccgg accccctgcc agttagatat tatgacaagg 240 

acaccaccaa accaatcagc ttttacttgt cttcgctgga ggagctcttg gcgtggaagc 3 00 

cccgcttgga ggatggcttt aatgtggccc tggagcccct ggcgtgtcgc cagccccctc 36 0 

tgagcagcca gaggccccgg actttgttgt gtcatgacat gatgggcggg tacctggatg 42 0 

acaggttcat tcagggctcg gtggtgcaga ctccctatgc tttctaccac tggcagtgca 480 

tcgacgtctt tgtgtacttc agccaccaca ccgtcaccat tcccccagtg ggctggacca 540 

acactgccca caggcatggg gtctgcgtgc tggggacttt catcacggag tggaatgaag 600 

ggggaaggct ctgtgaagcc ttcctggccg gggatgagcg ctcgtaccag gcagtggctg 660 
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accggctggt ccagatcact cagttttttc gttttgatgg ctggctgatc aacatcgaga 720 

actcgctgag tctggccgct gtggggaaca tgcctccttt cctgcggtac ctcaccacac 780 

agctgcaccg gcaggtccca gggggcctgg tgctctggta tgacagcgtg gtgcaaagtg 84 0 

ggcagctcaa atggcaagac gaactcaacc agcacaacag ggtcttcttt gattcctgcg 900 

acggcttctt cactaactat aactggcggg aggagcactt ggagcggatg ctggggcagg 960 

ctggggagcg ccgggctgat gtgtacgtgg gcgtggatgt gtttgctcga gggaacgtgg 1020 

tcggaggccg attcgacaca gacaagtcgt tggagctgat ccgaaagcat ggcttctccg 1080 

tggctttgtt tgcccccggc tgggtgtatg agtgtctgga gaagaaggat ttcttccaga 1140 

accaggacaa gttctggggc cgactggagc gttatctgcc cacacatagc atctgctcct 1200 

tgcctttcgt cacgtccttc tgcctgggca tgggtgcacg gagggtctgc tatggccagg 1260 

aagaggcggt agggccctgg taccacctga gcgcccagga gatccagccc ttgtttggag 132 0 

aacacaggct gggaggggat ggccggggct gggtgaggac gcactgctgc ctggaggatg 1380 

cctggcacgg aggcagctcc ctgctcgtcc ggggtgtgat cccaccggag gttggaaatg 1440 

tggctgtgag gtgggtgagt gacggaggac ggtgggccca ccagcttctc ccatcacacg 1500 

tggtggccat ggaactggac agatggggca gtggaggcca gaacaaggac agaggacaga 1560 

cccagatggg ttttttaaaa ttgtgatcaa aaatgcatta acgtgaaatt caccatcttg 1620 

accctgttga agtgaacggt acaggagtgt taactctgcg tacgttgcta cgtgacagat 1680 

ccctagaact gtgtcatctt gtaggactgg actctcccca ttgcacatta gccccccatc 1740 

ctcccccagc cccccaaagc caccccttta ctctgcatct gtgagtttgg atactttaga 1800 

tgccccatat gagggaatcc catacggtat ttcagaactg agatgtttca gcccacaaaa 1860 

gaatctggcc ttgggggagg acttgtcccc tggctacgga caggaccacg atgtgggagc 1920 

atttggcacc caggccccct gttcctggag ggaggggctg gtggacgcga tttggagctg 1980 

gcttcgattt ctcagtggtc tctccacggc acccatcaca ggaccttttt ccccaggtta 2040 

tttttattcc ctgcaggccc cagtgccacc caagatttac ctgtccatgg tgtataagct 2100 
tgaggggccc acggacgtca cagttgcttt ggagctgacc acaggggatg ccggcagctg 2160 
ccacatcggt ggcatctcag tgttgaacgc agaaacaagc tcaagacaca gcctccgacc 2220 
cctccgggtg ccccccacca agctggccag atgggtgggc cgctgcggcc ggcagctgag 2280 
tgggggctgg gtccagcact gctacgaggt gagcctgcgt gggtgcctgc tgctagacct 2340 
cctcgtttgc ttctcacggc cgccgggtag tcgggaggag gagagcttca cctgtcggct 2400 
tggagagatc caggtggtgg acgctgccag cctgctggcc cctctgcccc aggtgcaggc 2460 
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cgc caeca c c 


tctcacatcc 


gctggcagcc atccgcctct 


gagegggagg ggccccctgc 


2520 


cccgccccag 


ctcagctgca 


ccctgcactg gtccttcctc 


ctctcacaag 


teegttgett 




ccy aauc cac 


tgctggggag 


ggatgagtga tgactctccg ggcagggagc 


tgccgaggcc 




agagatgece 


atgttcctgg ggttggcttt tgccacccag 


taceggatag 


tggacctget 




ggtggaagcc 


gccgggcccg 


gecaggateg tcgcatggaa 


tttctggtgg 


agcctgtccc 


/ / ou 


caaggaaggg 


ttcegggtae 


ctcaggccga gtggggcagg 


gcagttctgc 


tttattcagc 


O Q O A 


ccctgcatga 


geggatgeta 


aggcegggtg gtctcctggc 


ctegggctga ggcctcttcc 


noon 


cggctgtctg 


cccctggcct 


gcgctggacc tgctaagtgc 


ccacagtggc 


agegaggtec 




cggtcccggg gctggggtgg gagaccccgg getgagtget 


gtggctttct 


ggtggggggc 


JuUU 


gatggaaaca ggaaaccaag 


cagtgggatc gcagcgttgg 


teactgegag gcgagtggcg 


3 06 U 


ggctttctgt 


ttctgccttg 


tccctcccca eggtacctgg 


ttcccaggtg aaaatgaaag 




gaggggagaa 


gttgagaaca 


gaacattcca taaaggatat 


ttcctaatag 


getgeaagat 


3 lo U 


getgatgecg 


agaatgatga 


ttttctttcc tgcagatgaa 


actattagaa 


agggtcttag 


324 u 


attgtggcag 


gtaggctttg 


gagcaggege cgagacattt 


ctgagcatga ggacgagcta 


*a ^ n f\ 


cagcagctcc 


tggggtgggg 


ctgcctgcgg gatggcggga 


gaggatgece 


tggagaaccg 


336 0 


tcctcccagt 


gtggaaggcc 


cttttccctg aggagtgggc 


attctgggee 


agccggcgct 


*a a o r\ 

342 0 


ggcttcgtgc 


ctccacgtgg gccagcccca gctgctccgt 


gtttcctggc 


gttggcaatt 


*^ si a r\ 
3480 


tactgtgctg 


ctgagtgtga 


ggtcatctcc ggagcgtttt 


cagcagcccc 


tggctctgcg 


•j C yl A 

3540 


gcgtctcttc 


cgggctgtgg geatgeaggg aagtggctct 


gaggcagtct 


gcgctgtggc 


JoUU 


cctgcctctg 


cccagcgaga ggccgtgggc tctggacaag ccgcccttca 


ggctggggta 


1 C C A 


gcaggtcagt 


ccaggcagga 


agcagcacct gccccccgcg 


ccagcccagc 


cccagcctga 


3720 


gtgeaggage 


tgcaggaccc 


gegggggett ttccagctac 


tctgttcctt 


cacgtcctcc 


3 780 


cttctcagcc 


tcgtccaagc 


acegggaaga cctccaggct 


gaccccttga 


gcagcagtca 


3 840 


gcacaggtgc gtgggggcgt 


gagggaggca gggtcttcac 


cacaggcgcc 


ttcctctgtc 


39Q0 


cttcctgctc 


tttcttctct 


gcccaggccg ctgcagctgc 


acagcctctg 


ctacacctgg 




gctgcctggg aggcttcctg gtgtggtgtc tggaccccac 


ggccttgggt 


catcctgtgg 


4 02 U 


ctggtctggg gtggggtctg 


ttgtggtcct tccacggtgt 


cagtggcctg 


aagtccctcg 


4080 


cttttggggg gggggtctct 


cacccccagg ccacataggg ccagtggtag gggttccctc 


4140 


tatgteggge 


agtgctgagg 


gctgggatgc tctgtgaccc 


cagctggagc 


ccacacctaa 


4200 


gggctggcat 


ccacatcatt 


tcaccctgca gtgagggaag 


aggccaccag 


gtggcagcac 


4260 


agccacaccc gttcccacgt 


cagaggaggg caaggctggg 


tactcagcag 


ccactctgag 


4320 



WO 2004/050900 



PCT/US2003/040131 



12 



ccggggctcc 


ttccaggagc 


tgaaatccac ctgtctccat cttccttgcc 


tgcctgggta 


4380 


ctcatgccaa 


gcagagactg ggattagggg ttctgtgctc ttgcctaatt 


aggaacattc 


4440 


tcccatgtct 


cttgtgtggt 


cccagaagga gaagtgagtt tgccaaggat 


atggggcagg 


4500 


aggctccctc 


tgctgacccc 


ctgcagcctg gagccagccc ggggactgtc 


ctgggtggag 


4560 


ggcaggtgaa 


cacaagctgc 


tgccggggac tgtcctgggt ggacggcagg 


tgaacacaag 


4620 


cggctgccgc 


atgtagccac 


tcactcgact ttttttcagc tgtgaccatt 


cctgggagct 


4680 


ctttgagcct 


ttctgtctca 


tttggaacca gggggaacca ggaaggggct 


cctggcctct 


4740 


ctgtgtcctc 


tgcagtgggg 


gttgtggggg gcgcagcatc cacgccttgc 


tgcccttctt 


4800 


tcatgaagtc 


tgttttttaa 


gtgctggttc ccccgaatat tttatgcaga 


ggagggaaaa 


4860 


tttatagtgg 


caattatttt 


ctcacagtct ggtgagcagg caattaatta ggagtaaggg 


4920 


ggcctagtag 


agcgtggcgt 


gtggcagaat cgcaccgccc cggctcccca 


gcccaccgcc 


4980 


atgcagggct 


cgcgtgcggg aaaactaata tgccggcgtt taagcctgtg 


cccctctgct 


5040 


gggtgtaact 


gcgctgaaat 


aaatgatctg acaatgtgaa aaaaaaaaaa 






<210> 6 

<211> 5621 

<212> DNA 

<213> Homo sapien 








<400> 6 
gcggcggtgg 


cgggaaggag 


catcaaagat gaagaagaag agacagtctt 


tcgagaggtg 


60 


gtcagttttt 


ccccggaccc 


cctgccagtt agatattatg acaaggacac 


caccaaacca 


120 


atcagctttt 


acttgtcttc gctggaggag ctcttggcgt ggaagccccg cttggaggat 


180 


ggctttaatg 


tggccctgga 


gcccctggcg tgtcgccagc cccctctgag 


cagccagagg 


240 


ccccggactt 


tgttgtgtca 


tgacatgatg ggcgggtacc tggatgacag 


ggactttcat 


300 


cacggagtgg 


aatgaagggg 


gaaggctctg tgaagccttc ctggccgggg 


atgagcgctc 


360 


gtaccaggca 


gtggctgacc 


ggctggtcca gatcactcag ttttttcgtt 


ttgatggctg 


420 


gctgatcaac 


atcgagaact 


cgctgagtct ggccgctgtg gggaacatgc 


ctcctttcct 


480 


gcggtacctc 


accacacagc 


tgcaccggca ggtcccaggg ggcctggtgc 


tctggtatga 


540 


cagcgtggtg 


caaagtgggc 


agctcaaatg gcaagacgaa ctcaaccagc acaacagggt 


600 


cttctttgat 


tcctgcgacg gcttcttcac taactataac tggcgggagg 


agcacttgga 


660 


gcggatgctg 


gggcaggctg gggagcgccg ggctgatgtg tacgtgggcg 


tggatgtgtt 


72 0 


tgctcgaggg 


aacgtggtcg 


gaggccgatt cgacacagac aaggtgggtg gtggctttcg 


780 


tccaagggcc 


agcggcccag 


tgcctcccct gggacctcat ttcctcatgg 


accttccatt 


840 
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cccgtctgca 


cctcaaagga 


atgacagcag 


ctgttcttcc 


cagagtgggg 


atcccgtggc 


900 


actgagaaac 


aggtgtcctg 


ccccggccaa 


gctatgtccc 


cactgaaacc 


cctttcccgg 


960 


ctatttctac 


aatcagataa 


tgtatttttg 


tgtgggtgtt 


accagacaga 


ggcgcgtcgt 


1020 


tcccccgctg 


tccggtcctc 


ggcgggcctg 


ggcctcctgt 


cccagctgcg 


ttccttgatc 


1080 


ttccccgtga 


gggaagcctg 


cggggacgca 


caggggcctg 


cagcgcctct 


cactgcggct 


1140 


caggggcgag 


gacgggggag 


gtgcccaggg 


tttggggatc 


ggtaggtgtc 


atgacgaggg 


1200 


agcattttct 


ggcaaggcag 


gtcacaccag 


cacagaggaa 


ggagcggggc 


caggccggtg 


1260 


cccctggcgt 


gggatgcgca 


gggctctgcc 


tgcctctggt 


ctatcagtcc 


tgcccgttcc 


1320 


cagagcccca 


gggatgttct 


cagcccttaa 


tttagcttta 


aatgagtaaa 


ttaatccaat 


1380 


aactaaagag 


cctctttact 


gcccgtgatt 


ctgataatcg 


aatgagaggt 


gtgccagagg 


1440 


acaatgggac 


cgcgcagcca 


cagccagttg 


gggcccagcc 


cctgcccccc 


tgcccttgca 


1500 


gcagcgctga 


ccgcctctcc 


cgcgccctct 


tctggcgggc 


ttgtggaacg 


acgggggtga 


1560 


tgcccagcag 


cctccagtcg 


ttggagctga 


tccgaaagca 


tggcttctcc 


gtggctttgt 


1620 


ttgcccccgg 


ctgggtgtat 


gagtgtctgg 


agaagaagga 


tttcttccag 


aaccaggaca 


1680 


agttctgggg 


ccgactggag 


cgttatctgc 


ccacacatag 


catctgctcc 


ttgcctttcg 


1740 


tcacgtcctt 


ctgcctgggc 


atgggtgcac 


ggagggtctg 


ctatggccag 


gaagaggcgg 


1800 


tagggccctg 


gtaccacctg 


agcgcccagg 


agatccagcc 


cttgtttgga 


gaacacaggc 


1860 


tgggagggga 


tggccggggc 


tgggtgagga 


cgcactgctg 


cctggaggat 


gcctggcacg 


1920 


gaggcagctc 


cctgctcgtc 


cggggtgtga 


tcccaccgga 


ggttggaaat 


gtggctgtga 


1980 


ggtgggtgag 


tgacggagga 


cggtgggccc 


accagcttct 


cccatcacac 


gtggtggcca 


2040 


tggaactgga 


cagatggggc 


agtggaggcc 


agaacaagga 


cagaggacag 


acccagatgg 


2100 


gttttttaaa 


attgtgatca 


aaaatgcatt 


aacgtgaaat 


tcaccatctt 


gaccctgttg 


2160 


aagtgaacgg 


tacaggagtg 


ttaactctgc 


gtacgttgct 


acgtgacaga 


tccctagaac 


2220 


tgtgtcatct 


tgtaggactg 


gactctcccc 


attgcacatt 


agccccccat 


cctcccccag 


2280 


ccccccaaag 


ccaccccttt 


actctgcatc 


tgtgagtttg 


gatactttag 


atgccccata 


2340 


tgagggaatc 


ccatacggta 


tttcagaact 


gagatgtttc 


agcccacaaa 


agaatctggc 


2400 


cttgggggag 


gacttgtccc 


ctggctacgg 


acaggaccac 


gatgtgggag 


catttggcac 


2460 


ccaggccccc 


tgttcctgga 


gggaggggct 


ggtggacgcg 


atttggagct 


ggcttcgatt 


2520 


tctcagtggt 


ctctccacgg 


cacccatcac 


aggacctttt 


tccccaggtt 


atttttattc 


2580 


cctgcaggcc 


ccagtgccac 


ccaagattta 


cctgtccatg 


gtgtataagc 


ttgaggggcc 


2640 
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cacggacgtc acagttgctt tggagctgac 
tggcatctca gtgttgaacg cagaaacaag 
gccccccacc aagctggcca gatgggtggg 
ggtccagcac tgctacgagg tgagcctgcg 
cttctcacgg ccgccgggta gtcgggagga 
ccaggtggtg gacgctgcca gcctgctggc 
ctctcacatc cgctggcagc catccgcctc 
gctcagctgc accctgcact ggtccttcct 
ctgctgggga gggatgagtg atgactctcc 
catgttcctg gggttggctt ttgccaccca 
cgccgggccc ggccaggatc gtcgcatgga 
gttccgggta cc tcaggccg agtggggcag 
agcggatgct aaggccgggt ggtctcctgg 
gcccctggcc tgcgctggac ctgctaagtg 
ggctggggtg ggagaccccg ggctgagtgc 
aggaaaccaa gcagtgggat cgcagcgttg 
tttctgcctt gtccctcccc acggtacctg 
agttgagaac agaacattcc ataaaggata 
gagaatgatg attttctttc ctgcagatga 
ggtaggcttt ggagcaggcg ccgagacatt 
ctggggtggg gctgcctgcg ggatggcggg 
tgtggaaggc ccttttccct gaggagtggg 
cctccacgtg ggccagcccc agctgctccg 
gctgagtgtg aggtcatctc cggagcgttt 
ccgggctgtg ggcatgcagg gaagtggctc 
gcccagcgag aggccgtggg ctctggacaa 
tccaggcagg aagcagcacc tgccccccgc 
ctgcaggacc cgcgggggct tttccagcta 
ctcgtccaag caccgggaag acctccaggc 
cgtgggggcg tgagggaggc agggtcttca 
ctttcttctc tgcccaggcc gctgcagctg 
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cacaggggat gccggcagct 


gccacatcgg 


o r\ f\ 
2 /OU 


ctcaagacac agcctccgac 


ccctccgggt 


2760 


ccgctgcggc cggcagctga gtgggggctg 


2820 


tgggtgcctg ctgctagacc 


tcctcgtttg 


2880 


ggagagcttc acctgtcggc ttggagagat 


2940 


ccctctgccc caggtgcagg 


ccgtcaccat 


3000 


tgagcgggag gggccccctg 


ctctgctcca 


3060 


cctctcacaa gtccgttgct 


tccgaatcca 


3 120 


gggcagggag ctgccgaggc 


cagagatgcc 


3180 


gtaccggata gtggacctgc 


tggtggaagc 


3240 


atttctggtg gagcctgtcc 


ccaaggaagg 


3300 


ggcagttctg ctttattcag 


cccctgcatg 


3360 


cctcgggctg aggcctcttc 


ccggctgtct 


3420 


cccacagtgg cagcgaggtc 


ccggtcccgg 


3480 


tgtggctttc tggtgggggg 


cgatggaaac 


3540 


gtcactgcga ggcgagtggc gggctttctg 


3600 


gttcccaggt gaaaatgaaa 


ggaggggaga 


3660 


tttcctaata ggctgcaaga 


tgctgatgcc 


3720 


aactattaga aagggtctta 


gattgtggca 


3780 


tctgagcatg aggacgagct 


acagcagctc 


' 3840 


agaggatgcc ctggagaacc 


gtcctcccag 


3900 


cattctgggc cagccggcgc 


tggcttcgtg 


3960 


tgtttcctgg cgttggcaat 


ttactgtgct 


4020 


tcagcagccc ctggctctgc ggcgtctctt 


4080 


tgaggcagtc tgcgctgtgg 


ccctgcctct 


4140 


gccgcccttc aggctggggt 


agcaggtcag 


4200 


gccagcccag ccccagcctg 


agtgcaggag 


4260 


ctctgttcct tcacgtcctc 


ccttctcagc 


4320 


tgaccccttg agcagcagtc 


agcacaggtg 


4380 


ccacaggcgc cttcctctgt 


ccttcctgct 


4440 


cacagcctct gctacacctg ggctgcctgg 


4500 
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gaggcttcct ggtgtggtgt ctggacccca cggccttggg tcatcctgtg gctggtctgg 4560 

ggtggggtct gttgtggtcc ttccacggtg tcagtggcct gaagtccctc gcttttgggg 4620 

ggggggtctc tcacccccag gccacatagg gccagtggta ggggttccct ctatgtcggg 4680 

cagtgctgag ggctgggatg ctctgtgacc ccagctggag cccacaccta agggctggca 4740 

tccacatcat ttcaccctgc agtgagggaa gaggccacca ggtggcagca cagccacacc 4800 

cgttcccacg tcagaggagg gcaaggctgg gtactcagca gccactctga gccggggctc 4 860 

cttccaggag ctgaaatcca cctgtctcca tcttccttgc ctgcctgggt actcatgcca 4920 

agcagagact gggattaggg gttctgtgct cttgcctaat taggaacatt ctcccatgtc 4980 

tcttgtgtgg tcccagaagg agaagtgagt ttgccaagga tatggggcag gaggctccct 5 040 

ctgctgaccc cctgcagcct ggagccagcc cggggactgt cctgggtgga gggcaggtga 5100 

acacaagctg ctgccgggga ctgtcctggg tggacggcag gtgaacacaa gcggctgccg 5160 

catgtagcca ctcactcgac tttttttcag ctgtgaccat tcctgggagc tctttgagcc 5220 

tttctgtctc atttggaacc agggggaacc aggaaggggc tcctggcctc tctgtgtcct 5280 

ctgcagtggg ggttgtgggg ggcgcagcat ccacgccttg ctgcccttct ttcatgaagt 5340 

ctgtttttta agtgctggtt cccccgaata ttttatgcag aggagggaaa atttatagtg 54 00 

gcaattattt tctcacagtc tggtgagcag gcaattaatt aggagtaagg gggcctagta 5460 

gagcgtggcg tgtggcagaa tcgcaccgcc ccggctcccc agcccaccgc catgcagggc 5520 

tcgcgtgcgg gaaaactaat atgccggcgt ttaagcctgt gcccctctgc tgggtgtaac 5580 

tgcgctgaaa taaatgatct gacaatgtga aaaaaaaaaa a 5621 

<210> 7 

<211> 5519 

<212> DNA 

<213> Homo sapien 

<400> 7 

•acagtgtcat ggaggccgcg gcggtgacgg tcacccggtc ggctacacgg cggcggcggc 60 

ggcagctgca ggggctggcg gccccggagg cggggacgca ggaggagcag gaggatcagg 120 

agccgcggcc gcggcggcgg cggccgggaa ggagcatcaa agatgaagaa gaagagacag 180 

tctttcgaga ggtggtcagt ttttccccgg accccctgcc agttagatat tatgacaagg 240 

acaccaccaa accaatcagc ttttacttgt cttcgctgga ggagctcttg gcgtggaagc 300 

cccgcttgga ggatggcttt aatgtggccc tggagcccct ggcgtgtcgc cagccccctc 360 

tgagcagcca gaggccccgg actttgttgt gtcatgacat gatgggcggg tacctggatg 420 

acaggttcat tcagggctcg gtggtgcaga ctccctatgc tttctaccac tggcagtgca 480 
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tcgacgt'ctt tgtgtacttc agccaccaca ccgtcaccat tcccccagtg ggctggacca 540 

acactgccca caggcatggg gtctgcgtgc tggggacttt catcacggag tggaatgaag 600 

ggggaaggct ctgtgaagcc ttcctggccg gggatgagcg ctcgtaccag gcagtggctg 660 

accggctggt ccagatcact cagttttttc gttttgatgg ctggctgatc aacatcgaga 720 

actcgctgag tctggccgct gtggggaaca tgcctccttt cctgcggtac ctcaccacac 780 

agctgcaccg gcaggtccca gggggcctgg tgctctggta tgacagcgtg gtgcaaagtg 84 0 

ggcagctcaa atggcaagac gaactcaacc agcacaacag ggtcttcttt gattcctgcg 900 

acggcttctt cactaactat aactggcggg aggagcactt ggagcggatg ctggggcagg 960 

ctggggagcg ccgggctgat gtgtacgtgg gcgtggatgt gtttgctcga gggaacgtgg 1020 

tcggaggccg attcgacaca gacaaggtgg gtggtggctt tcgtccaagg gccagcggcc 1080 

cagtgcctcc cctgggacct catttcctca tggaccttcc attcccgtct gcacctcaaa 1140 

ggaatgacag cagctgttct tcccagagtg gggatcccgt ggcactgaga aacaggtgtc 1200 

ctgccccggc caagctatgt ccccactgaa acccctttcc cggctatttc tacaatcaga 1260 

taatgtattt ttgtgtgggt gttaccagac agaggcgcgt cgttcccccg ctgtccggtc 132 0 

ctcggcgggc ctgggcctcc tgtcccagct gcgttccttg atcttccccg tgagggaagc 13 80 

ctgcggggac gcacaggggc ctgcagcgcc tctcactgcg gctcaggggc gaggacgggg 144 0 

gaggtgccca gggtttgggg atcggtaggt gtcatgacga gggagcattt tctggcaagg 1500 

caggtcacac cagcacagag gaaggagcgg ggccaggccg gtgcccctgg cgtgggatgc 1560 

gcagggctct gcctgcctct ggtctatcag tcctgcccgt tcccagagcc ccagggatgt 1620 

tctcagccct taatttagct ttaaatgagt aaattaatcc aataactaaa gagcctcttt 168 0 

actgcccgtg attctgataa tcgaatgaga ggtgtgccag aggacaatgg gaccgcgcag 174 0 

ccacagccag ttggggccca gcccctgccc ccctgccctt gcagcagcgc tgaccgcctc 1800 

tcccgcgccc tcttctggcg ggcttgtgga acgacggggg tgatgcccag cagcctccag 1860 

tcgttggagc tgatccgaaa gcatggcttc tccgtggctt tgtttgcccc cggctgggtg 192 0 

tatgagtgtc tggagaagaa ggatttcttc cagaaccagg acaagttctg gggccgactg 198 0 

gagcgttatc tgcccacaca tagcatctgc tccttgcctt tcgtcacgtc cttctgcctg 204 0 

ggcatgggtg cacggagggt ctgctatggc caggaagagg cggtagggcc ctggtaccac 2100 

ctgagcgccc aggagatcca gcccttgttt ggagaacaca ggctgggagg ggatggccgg 2160 

ggctgggtga ggacgcactg ctgcctggag gatgcctggc acggaggcag ctccctgctc 2220 

gtccggggtg tgatcccacc ggaggttgga aatgtggctg tgaggtgggt gagtgacgga 2280 
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ggacggtggg 


cccaccagct 


tctcccatca 


cacgtggtgg 


ccatggaact 


ggacagatgg 


2340 


ggcagtggag 


gccagaacaa 


ggacagagga 


cagacccaga 


tgggcttctt 


aaaattgtga 


2400 


tcaaaaatgc 


attaacgtga aattcaccat 


cttgaccctg 


ttgaagtgaa 


cggtacagga 


2460 


gty cuaacLC 


tgcgtacgtt 


gctacgtgac 


agatccctag 


aactgtgtca 


tcttgtagga 


2520 


cfcggacfcctc 


cccattgcac 


attagccccc 


catcctcccc 


cagcccccca 


aagccacccc 


2580 


tttaccctgc 


atctgtgagt 


ttggatactt 


tagatgcccc 


atatgaggga 


atcccatacg 


2640 


gtattncaga 


actgagatgt 


ttcagcccac aaaagaatct 


ggccttgggg 


gaggacttgt 


2700 


cccctggcfca 


cggacaggac 


cacgatgtgg gagcatttgg 


cacccaggcc 


ccctgttcct 


2760 


gsagggaggg 


gctggtggac 


gcgatttgga 


gctggcttcg 


atttctcagt 


ggtctctcca 


2820 


cggcacccat 


cacaggacct 


ttttccccag gttattttta 


ttccctgcag 


gccccagtgc 


2880 


cacccaagat 


ttacctgtcc 


atggtgtata 


agcttgaggg 


gcccacggac 


gtcacagttg 


2940 


cut cggagct 


gaccacaggg gatgccggca 


gctgccacat 


cggtggcatc 


tcagtgttga 


3000 


acgcagaaac 


aagctcaaga 


cacagcctcc 


gacccctccg 


ggtgcccccc 


accaagctgg 


3060 


ccagatgggt 


gggccgctgc 


ggccggcagc 


tgagtggggg 


ctgggtccag 


cactgctacg 


3120 


aggtgagcct 


gcgtgggtgc 


ctgctgctag 


acctcctcgt 


ttgcttctca 


cggccgccgg 


3180 


gtagtcggga 


ggaggagagc 


ttcacctgtc 


ggcttggaga 


gatccaggtg 


gtggacgctg 


3240 


ccagcctgct 


ggcccctctg 


ccccaggtgc 


aggccgtcac 


catctctcac 


atccgctggc 


3300 


agccatccgc 


ctctgagcgg 


gaggggcccc 


ctgctctgct 


ccagctcagc 


tgcaccctgc 


3360 


actggtcctt 


cctcctctca 


caagtccgtt 


gcttccgaat 


ccactgctgg ggagggatga 


3420 


gtgatgactc 


tccgggcagg gagctgccga 


ggccagagat 


gcccatgttc 


ctggggttgg 


3480 


cttttgccac 


ccagtaccgg 


atagtggacc 


tgctggtgga 


agccgccggg 


cccggccagg 


3540 


atcgtcgcafc 


ggaatttctg 


gtggagcctg 


tccccaagga 


agggttccgg 


gtacctcagg 


3600 


ccgagtgggg 


cagggcagtt 


ctgctttatt 


cagcccctgc 


atgagcggat 


gctaaggccg 


3660 


ggtggtctcc 


tggcctcggg 


ctgaggcctc 


ttcccggctg 


tctgcccctg 


gcctgcgctg 


3720 


gacctgctaa 


gtgcccacag 


tggcagcgag gtcccggtcc 


cggggctggg gtgggagacc 


3780 


ccgggctgag 


tgctgtggct 


ttctggtggg 


gggcgatgga 


aacaggaaac 


caagcagtgg 


3840 


gatcgcagcg 


ttggtcactg 


cgaggcgagt 


ggcgggcttt 


ctgtttctgc 


cttgtccctc 


3900 


cccacggtac 


ctggttccca 


ggtgaaaatg 


aaaggagggg 


agaagttgag 


aacagaacat 


3960 


tccataaagg 


atatttccta 


ataggctgca 


agatgctgat 


gccgagaatg atgattttct 


4020 


ttcctgcaga 


tgaaactatt 


agaaagggtc 


ttagattgtg 


gcaggtaggc 


tttggagcag 


4080 


gcgccgagac 


atttctgagc 


atgaggacga gctacagcag 


ctcctggggt 


ggggctgcct 


4140 
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gcgggatggc gggagaggat gccctggaga accgtcctcc cagtgtggaa ggcccttttc 4200 

cctgaggagt gggcattctg ggccagccgg cgctggcttc gtgcctccac gtgggccagc 4260 

cccagctgct ccgtgtttcc tggcgttggc aatttactgt gctgctgagt gtgaggtcat 4320 

ctccggagcg ttttcagcag cccctggctc tgcggcgtct cttccgggct gtgggcatgc 4380 

agggaagtgg ctctgaggca gtctgcgctg tggccctgcc tctgcccagc gagaggccgt 4440 

gggctctgga caagccgccc ttcaggctgg ggtagcaggt cagtccaggc aggaagcagc 4500 

acctgccccc cgcgccagcc cagccccagc ctgagtgcag gagctgcagg acccgcgggg 4560 

gcttttccag ctactctgtt ccttcacgtc ctcccttctc agcctcgtcc aagcaccggg 4620 

aagacctcca ggctgacccc ttgagcagca gtcagcacag tgagggaaga ggccaccagg 4680 

tggcagcaca gccacacccg ttcccacgtc agaggagggc aaggctgggt actcagcagc 4740 

cactctgagc cggggctcct tccaggagct gaaatccacc tgtctccatc ttccttgcct 4800 

gcctgggtac tcatgccaag cagagactgg gattaggggt tctgtgctct tgcctaatta 4860 

ggaacattct cccatgtctc ttgtgtggtc ccagaaggag aagtgagttt gccaaggata 4920 

tggggcagga ggctccctct gctgaccccc tgcagcctgg agccagcccg gggactgtcc 4980 

tgggtggagg gcaggtgaac acaagctgct gccggggact gtcctgggtg gacggcaggt 504 0 

gaacacaagc ggctgccgca tgtagccact cactcgactt tttttcagct gtgaccattc 5100 

ctgggagctc tttgagcctt tctgtctcat ttggaaccag ggggaaccag gaaggggctc 5160 

ctggcctctc tgtgtcctct gcagtggggg ttgtgggggg cgcagcatcc acgccttgct 5220 

gcccttcttt catgaagtct gttttttaag tgctggttcc cccgaatatt ttatgcagag 5280 

gagggaaaat ttatagtggc aattattttc tcacagtctg gtgagcaggc aattaattag 534 0 

gagtaagggg gcctagtaga gcgtggcgtg tggcagaatc gcaccgcccc ggctccccag 5400 

cccaccgcca tgcagggctc gcgtgcggga aaactaatat gccggcgttt aagcctgtgc 5460 

ccctctgctg ggtgtaactg cgctgaaata aatgatctga caatgtgaaa aaaaaaaaa 5519 

<210> 8 

<211> 6102 

<212> DNA 

<213> Homo sapien 

<400> 8 

acagtgtcat ggaggccgcg gcggtgacgg tcacccggtc ggctacacgg cggcggcggc 60 

ggcagctgca ggggctggcg gccccggagg cggggacgca ggaggagcag gaggatcagg 120 

agccgcggcc gcggcggcgg cggccgggaa ggagcatcaa agatgaagaa gaagagacag 180 

tctttcgaga ggtggtcagt ttttccccgg accccctgcc agttagatat tatgacaagg 240 
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acaccaccaa 


accaatcagc 


ttttacttgt cttcgctgga 


ggagctcttg 


gcgtggaagc 


300 


cccgcttgga 


ggatggcfctt 


aatgtggccc tggagcccct 


ggcgtgtcgc 


cagccccctc 


360 


tgagcagcca gaggccccgg 


actttgttgt gtcatgacat 


gatgggcggg 


tacctggatg 


420 


acaggttcat 


tcagggctcg gtggtgcaga ctccctatgc 


tttctaccac 


tggcagtgca 


480 


tcgacgtctt 


tgtgtacttc 


agccaccaca ccgtcaccat 


tcccccagtg ggctggacca 


540 


acactgccca 


caggcatggg 


gtctgcgtgc tggggacttt 


catcacggag 


tggaatgaag 


600 


ggggaaggct 


ctgtgaagcc 


ttcctggccg gggatgagcg 


ctcgtaccag gcagtggctg 


660 


accggctggt 


ccagatcact 


cagttttttc gttttgatgg 


ctggctgatc 


aacatcgaga 


720 


actcgctgag 


tctggccgct 


gtggggaaca tgcctccttt 


cctgcggtac 


ctcaccacac 


780 


agctgcaccg 


gcaggtccca 


g9999cctgg tgctctggta 


tgacagcgtg gtgcaaagtg 


840 


ggcagctcaa atggcaagac gaactcaacc agcacaacag ggtcttcttt 


gattcctgcg 


900 


acggcttctt 


cactaactat 


aactggcggg aggagcactt 


ggagcggatg 


ctggggcagg 


960 


ctggggagcg 


ccgggctgat 


gtgtacgtgg gcgtggatgt 


gtttgctcga gggaacgtgg 


1020 


tcggaggccg 


attcgacaca 


gacaaggtgg gtggtggctt 


tcgtccaagg gccagcggcc 


1080 


cagtgcctcc 


cctgggacct 


catttcctca tggaccttcc 


attcccgtct 


gcacctcaaa 


1140 


ggaatgacag 


cagctgttct 


tcccagagtg gggatcccgt 


ggcactgaga 


aacaggtgtc 


1200 


ctgccccggc 


caagctatgt 


ccccactgaa acccctttcc 


cggctatttc 


tacaatcaga 


1260 


taatgtattt 


ttgtgtgggt 


gttaccagac agaggcgcgt 


cgttcccccg 


ctgtccggtc 


1320 


ctcggcgggc 


ctgggcctcc 


tgtcccagct gcgttccttg 


atcttccccg 


tgagggaagc 


1380 


ctgcggggac 


gcacaggggc 


ctgcagcgcc tctcactgcg gctcaggggc 


gaggacgggg 


1440 


gaggtgccca gggtttgggg atcggtaggt gtcatgacga gggagcattt 


tctggcaagg 


1500 


caggtcacac 


cagcacagag gaaggagcgg ggccaggccg gtgcccctgg 


cgtgggatgc 


1560 


gcagggctct 


gcctgcctct 


ggtctatcag tcctgcccgt 


tcccagagcc 


ccagggatgt 


1620 


tctcagccct 


taatttagct 


ttaaatgagt aaattaatcc 


aataactaaa 


gagcctcttt 


1680 


actgcccgtg 


attctgataa 


tcgaatgaga ggtgtgccag 


aggacaatgg 


gaccgcgcag 


1740 


ccacagccag 


ttggggccca 


gcccctgccc ccctgccctt 


gcagcagcgc 


tgaccgcctc 


1800 


tcccgcgccc 


tcttctggcg ggcttgtgga acgacggggg tgatgcccag 


cagcctccag 


1860 


tcgttggagc 


tgatccgaaa 


gcatggcttc tccgtggctt 


tgtttgcccc 


cggctgggtg 


1920 


tatgagtgtc 


tggagaagaa 


ggatttcttc cagaaccagg acaagttctg gggccgactg 


1980 


gagcgttatc 


tgcccacaca 


tagcatctgc tccttgcctt 


tcgtcacgtc 


cttctgcctg 


2040 
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pa ppT Pa /Tp"p 4- 

c ci eg y ay gg t. 


cugccauggc 


caggaagagg 


eggtagggee 


ctggtaccac 


2100 


Q- uy ay cy ccc 


aggagaccca 


gcccccgttt 


ggagaacaca 


ggctgggagg 


ggatggccgg 


2160 




p/r*ra pp/p a p#~ p; 

99 ac ycactg 


c cgee uggag 


gatgectgge 


aeggaggcag 


ctccctgctc 


2220 


eft" <"* r^erer fTfrt* rr 


ugatcccacc 


ggaggccgga 


aatgtggctg 


tgaggtgggt 


gagtgacgga 


2280 


99 clt *99 t 999 


pppa p p a p-p4- 

CCCaCCayCL 


t c tccca tea 


cacgtggtgg 


ccatggaact 


ggacagatgg 


2340 


y y *-ay ^99*9 


f~r ~s pr a a pa a 

gecagaacaa 


ggacagagga 


cagacccaga 


tgggtttttt 


aaaattgtga 


2400 




attaaCgCya 


aatt caeca t 


ct tgaccctg 


ttgaagtgaa 


eggtacagga 


2460 




p/pp4- a p/>v4" 4» 

cgcgcacgc u 


gefcaegtgae 


agatccctag 


aactgtgtca 


tcttgtagga 


2520 


t-uyyacc etc 


cccauLycac 


attagccccc 


catcct cccc 


cagcccccca 


aagccacccc 


2580 




a t~ p f- pt- pa p4- 

ateugegage 


cuggacacct 


tagatgcccc 


atatgaggga 


atcccatacg 


2640 


oft" a. t~ l~ hr»arra 


— > p 4* pap/a p/4- 
aCl.yclyai.yu 


ttcagcccac 


aaaagaa t c t 


ggccttgggg 


gaggacttgt 


2700 


ppppt* ptptp t~ a 
L.t,L.uuyyt-La 


pp/p;a pa#-rpT=i p 

eggacaggae 


cacgatgtgg 


gagcatttgg 


cacccaggcc 


ccctgttcct 


2760 


99 a 99g a ggg 


gccggcggac 


gcgatttgga 


gctggcttcg 


atttctcagt 


ggtctctcca 


2820 


p pfp; p a p p « a 4~ 


cacaggacct 


cctcccccag 


gttattttta 


ttccctgcag 


gccccagtgc 


2880 


pa. P p p a 3 na 4~ 


ttacccyccc 


atggtgtata 


agcttgaggg 


gcccacggac 


gtcacagttg 


2940 


CLLtggayCu 


gaccacaggg 


ga t gc eggea 


gctgccacat 


cggtggcatc 


tcagtgttga 


3000 


a ppr f«i 3 pa a 9 p 
civ— y Uay aaaL. 


a a pp 4- p a a *»*a 

adyCLcaaya 


cacagcctcc 


gacccctccg 


ggtgcccccc 


accaagctgg 


3060 


p p a pa ♦* rrnnf 

uuayaLyyy l 


yyyccgcugc 


ggccggcagc 


tgagtggggg 


ctgggtccag 


cactgctacg 


3120 


agg cgaycct 


P Pi Pf i— p / p T i« 

gcgxgggtgc 


ctgetgetag 


acctcctcgt 


ttgettctea 


cggccgccgg 


3180 


yLaycCygga 


99 a ggagagc 


ttcacctgtc 


ggcttggaga 


gatccaggtg 


atgcttccca 


3240 


pa pep ^«rp p 4~ p»"» 

y ay g y g C CCy 


ggccgggcxg 


gccgtctgtc 


cagctggagt 


gggggtgg a g 


gccgccccag 


3300 


gcgyCCCCCt 


tctngggttc 


a gcggggaac 


tggggtggag 


gagtcaggga 


ggggaaatgt 


3360 


gcgcgcgggg 


P/P a p p p Kot> 

ycaccctcuc 


cccgcccccg 


ggcgtccagc 


cgtgctgagc 


cttctctcct 


3420 


yuuayy Lyy l» 


p/rta p pp +• ppp 

yyacgctgcc 


agccugccgg 


cccctctgcc 


ccaggtgcag 


gccgtcacca 


3480 


f* pt" pi - pa pa H 


^y*»p>pa 

ccyc eggcag 


ccat ccgcct 


ctgagcggga 


ggggccccct 


gctctgctcc 


3540 


a /"t p 4- «a /"r 4— 

ayCLCayCLg 


caccccycac 


cggtccttcc 


tcctctcaca 


agtccgttgc 


ttccgaatcc 


3600 


actgctgggg 


agggatgagt 


gatgactctc 


egggcaggga 


getgecgagg 


ecagagatge 


3660 


ccatgttcct 


ggggttggct 


tttgccaccc 


agtaceggat 


agtggacctg 


ctggtggaag 


3720 


ccgccgggcc 


eggecaggat 


cgtcgcatgg 


aatttctggt 


ggagcctgtc 


cccaaggaag 


3780 


ggttccgggt 


acctcaggcc 


gagtggggca 


gggcagttct 


gctttattca 


gcccctgcat 


3840 


gageggatge 


taaggccggg 


tggtctcctg 


gcctcgggct 


gaggectett 


cccggctgtc 


3900 
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tgcccctggc ctgcgctgga cctgctaagt gcccacagtg gcagcgaggt cccggtcccg 3960 

gggctggggt gggagacccc gggctgagtg ctgtggcttt ctggtggggg gcgatggaaa 4020 

caggaaacca agcagtggga tcgcagcgtt ggtcactgcg aggcgagtgg cgggctttct 4080 

gtttctgcct tgtccctccc cacggtacct ggttcccagg tgaaaatgaa aggaggggag 414 0 

aagttgagaa cagaacattc cataaaggat atttcctaat aggctgcaag atgctgatgc 4200 

cgagaatgat gattttcttt cctgcagatg aaactattag aaagggtctt agattgtggc 4260 

aggtaggctt tggagcaggc gccgagacat ttctgagcat gaggacgagc tacagcagct 4320 

cctggggtgg ggctgcctgc gggatggcgg gagaggatgc cctggagaac cgtcctccca 4380 

gtgtggaagg cccttttccc tgaggagtgg gcattctggg ccagccggcg ctggcttcgt 4440 

gcctccacgt gggccagccc cagctgctcc gtgtttcctg gcgttggcaa tttactgtgc 45 00 

tgctgagtgt gaggtcatct ccggagcgtt ttcagcagcc cctggctctg cggcgtctct 4560 

tccgggctgt gggcatgcag ggaagtggct ctgaggcagt ctgcgctgtg gccctgcctc 4620 

tgcccagcga gaggccgtgg gctctggaca agccgccctt caggctgggg tagcaggtca 4680 

gtccaggcag gaagcagcac ctgccccccg cgccagccca gccccagcct gagtgcagga 4740 

gctgcaggac ccgcgggggc ttttccagct actctgttcc ttcacgtcct cccttctcag 4800 
cctcgtccaa gcaccgggaa gacctccagg ctgacccctt gagcagcagt cagcacaggt 486 0 
gcgtgggggc gtgagggagg cagggtcttc accacaggcg ccttcctctg tccttcctgc 4920 
tctttcttct ctgcccaggc cgctgcagct gcacagcctc tgctacacct gggctgcctg 4980 
ggaggcttcc tggtgtggtg tctggacccc acggccttgg gtcatcctgt ggctggtctg 5040 
gggtggggtc tgttgtggtc cttccacggt gtcagtggcc tgaagtccct cgcttttggg 5100 
gggggggtct ctcaccccca ggccacatag ggccagtggt aggggttccc tctatgtcgg 5160 
gcagtgctga gggctgggat gctctgtgac cccagctgga gcccacacct aagggctggc 5220 
atccacatca tttcaccctg cagtgaggga agaggccacc aggtggcagc acagccacac 5280 
ccgttcccac gtcagaggag ggcaaggctg ggtactcagc agccactctg agccggggct 5340 
ccttccagga gctgaaatcc acctgtctcc atcttccttg cctgcctggg tactcatgcc 5400 
aagcagagac tgggattagg ggttctgtgc tcttgcctaa ttaggaacat tctcccatgt 5460 
ctcttgtgtg gtcccagaag gagaagtgag tttgccaagg atatggggca ggaggctccc 5520 
tctgctgacc ccctgcagcc tggagccagc ccggggactg tcctgggtgg agggcaggtg 5580 
aacacaagct gctgccgggg actgtcctgg gtggacggca ggtgaacaca agcggctgcc 5640 
gcatgtagcc actcactcga ctttttttca gctgtgacca ttcctgggag ctctttgagc 5700 
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ctttctgtct 
tctgcagtgg 
tctgtttttt 
ggcaattatt 
agagcgtggc 
ctcgcgtgcg 
ctgcgctgaa 
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catttggaac cagggggaac caggaagggg ctcctggcct ctctgtgtcc 5760 

gggttgtggg gggcgcagca tccacgcctt gctgcccttc tttcatgaag 5820 

aagtgctggt tcccccgaat attttatgca gaggagggaa aatttatagt 5880 

ttctcacagt ctggtgagca ggcaattaat taggagtaag ggggcctagt 5940 

gtgtggcaga atcgcaccgc cccggctccc cagcccaccg ccatgcaggg 6000 

ggaaaactaa tatgccggcg tttaagcctg tgcccctctg ctgggtgtaa 6060 

ataaatgatc tgacaatgtg aaaaaaaaaa aa 6102 



<210> 9 

<211> 3177 

<212> DNA 

<213> Homo sapien 

<400> 9 

acagtgtcat ggaggccgcg gcggtgacgg tcacccggtc ggctacacgg cggcggcggc 
ggcagctgca ggggctggcg gccccggagg cggggacgca ggaggagcag gaggatcagg 
agccgcggcc gcggcggcgg cggccgggaa ggagcatcaa agatgaagaa gaagagacag 
tctttcgaga ggtggtcagt ttttccccgg accccctgcc agttagatat tatgacaagg 
acaccaccaa accaatcagc ttttacttgt cttcgctgga ggagctcttg gcgtggaagc 
cccgcttgga ggatggcttt aatgtggccc tggagcccct ggcgtgtcgc cagccccctc 
tgagcagcca gaggccccgg actttgttgt gtcatgacat gatgggcggg tacctggatg 
acaggttcat tcagggctcg gtggtgcaga ctccctatgc tttctaccac tggcagtgca 
tcgacgtctt tgtgtacttc agccaccaca ccgtcaccat tcccccagtg ggctggacca 
acactgccca caggcatggg gtctgcgtgc tggggacttt catcacggag tggaatgaag 
ggggaaggct ctgtgaagcc ttcctggccg gggatgagcg ctcgtaccag gcagtggctg 
accggctggt ccagatcact cagttttttc gttttgatgg ctggctgatc aacatcgaga 
actcgctgag tctggccgct gtggggaaca tgcctccttt cctgcggtac ctcaccacac 
agctgctggt ggaagccgcc gggcccggcc aggatcgtcg catggaattt ctggtggagc 
ctgtccccaa ggaagggttc cgggtacctc aggccgagtg gggcagggca gttctgcttt 
attcagcccc tgcatgagcg gatgctaagg ccgggtggtc tcctggcctc gggctgaggc 
ctcttcccgg ctgtctgccc ctggcctgcg ctggacctgc taagtgccca cagtggcagc 
gaggtcccgg tcccggggct ggggtgggag accccgggct gagtgctgtg gctttctggt 
ggggggcgat ggaaacagga aaccaagcag tgggatcgca gcgttggtca ctgcgaggcg 
agtggcgggc tttctgtttc tgccttgtcc ctccccacgg tacctggttc ccaggtgaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080. 
1140 
1200 
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atgaaaggag gggagaagtt gagaacagaa 
gcaagatgct gatgccgaga atgatgattt 
gtcttagatt gtggcaggta ggctttggag 
cgagctacag cagctcctgg ggtggggctg 
agaaccgtcc tcccagtgtg gaaggccctt 
cggcgctggc ttcgtgcctc cacgtgggcc 
ggcaatttac tgtgctgctg agtgtgaggt 
ctctgcggcg tctcttccgg gctgtgggca 
ctgtggccct gcctctgccc agcgagaggc 
tggggtagca ggtcagtcca ggcaggaagc 
agcctgagtg caggagctgc aggacccgcg 
gtcctccctt ctcagcctcg tccaagcacc 
gcagtcagca caggtgcgtg ggggcgtgag 
ctctgtcctt cctgctcttt cttctctgcc 
cacctgggct gcctgggagg cttcctggtg 
cctgtggctg gtctggggtg gggtctgttg 
tccctcgctt ttgggggggg ggtctctcac 
ttccctctat gtcgggcagt gctgagggct 
cacctaaggg ctggcatcca catcatttca 
gcagcacagc cacacccgtt cccacgtcag 
ctctgagccg gggctccttc caggagctga 
ctgggtactc atgccaagca gagactggga 
aacattctcc catgtctctt gtgtggtccc 
gggcaggagg ctccctctgc tgaccccctg 
ggtggagggc aggtgaacac aagctgctgc 
acacaagcgg ctgccgcatg tagccactca 
gggagctctt tgagcctttc tgtctcattt 
ggcctctctg tgtcctctgc agtgggggtt 
ccttctttca tgaagtctgt tttttaagtg 
gggaaaattt atagtggcaa ttattttctc 
gtaagggggc ctagtagagc gtggcgtgtg 
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cattccataa 


aggatatttc 


ctaataggct 


1260 


tctttcctgc 


agatgaaact 


attagaaagg 


1320 


caggcgccga gacatttctg agcatgagga 


1380 


cctgcgggat 


ggcgggagag 


gatgccctgg 


1440 


ttccctgagg 


agtgggcatt 


ctgggccagc 


1500 


agccccagct 


gctccgtgtt 


tcctggcgtt 


1560 


catctccgga 


gcgttttcag 


cagcccctgg 


1620 


tgcagggaag 


tggctctgag 


gcagtctgcg 


1680 


cgtgggctct 


ggacaagccg 


cccttcaggc 


1740 


agcacctgcc 


ccccgcgcca 


gcccagcccc 


1800 


ggggcttttc 


cagctactct 


gttccttcac 


1860 


gggaagacct 


ccaggctgac 


cccttgagca 


1920 


ggaggcaggg 


tcttcaccac 


aggcgccttc 


1980 


caggccgctg cagctgcaca gcctctgcta 


2040 


tggtgtctgg 


accccacggc 


cttgggtcat 


2100 


tggtccttcc 


acggtgtcag 


tggcctgaag 


2160 


ccccaggcca 


catagggcca 


gtggtagggg 


2220 


gggatgctct 


gtgaccccag 


ctggagccca 


2280 


ccctgcagtg agggaagagg 


ccaccaggtg 


2340 


aggagggcaa 


ggctgggtac 


tcagcagcca 


2400 


aatccacctg 


tctccatctt 


ccttgcctgc 


2460 


ttaggggttc 


tgtgctcttg 


cctaattagg 


2520 


agaaggagaa 


gtgagtttgc 


caaggatatg 


2580 


cagcctggag 


ccagcccggg 


gactgtcctg 


2640 


cggggactgt 


cctgggtgga 


cggcaggtga 


2700 


ctcgactttt 


tttcagctgt 


gaccattcct 


2760 


ggaaccaggg 


ggaaccagga 


aggggctcct 


2820 


gtggggggcg 


cagcatccac 


gccttgctgc 


2880 


f ctggttcccc 


cgaatatttt 


atgcagagga 


2940 


: acagtctggt 


gagcaggcaa 


ttaattagga 


3000 


f gcagaatcgc 


accgccccgg 


ctccccagcc 


3060 
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caccgccatg cagggctcgc gtgcgggaaa actaatatgc cggcgtttaa gcctgtgccc 3120 
ctctgctggg tgtaactgcg ctgaaataaa tgatctgaca atgtgaaaaa aaaaaaa 3177 

<210> 10 

<211> 1956 

<212> DNA 

<213> Homo sapien 

<400> 10 



ccggctccac ccccaagcca 


ggcgaggcag gttccgaggt 


tggaacacct 


ggcgagtcct 


60 


cggtgtcggt ggccggcagt 


catctcgcgg ccgttcagga taagccagag acatgggaaa 


120 


accaatggaa gtgtttcaaa gttcttctct 


ttaatcactt 


ctgtatccag 


ctgcctttga 


180 


tttgtggaac ctattatttt 


acagagtatt 


tcaatattcc 


ttatgattgg gaaagaatgc 


240 


caagatggta ttttcttttg gcaagatgct 


ttggttgtgc 


agtcattgaa 


gatacttggc 


300 


actattttct gcatagactc 


ttacaccaca 


aaagaatata 


caagtatatt 


cataaagttc 


360 


atcatgagtt tcaggctcca 


tttggaatgg 


aagctgaata 


tgcacatcct 


ttggagactc 


420 


taattcttgg aactggattt 


ttcattggaa 


tcgtgctttt 


gtgtgatcat 


gtaattcttc 


480 


tttgggcatg ggtgaccatt 


cgtttattag 


aaactattga 


tgtccatagt 


ggttatgata 


540 


ttcctctcaa ccctttaaat 


ctgatccctt 


tctatgctgg 


ttctcggcat 


catgatttcc 


600 


accacatgaa cttcattgga 


aactatgctt 


caacatttac 


atggtgggat 


cgaatttttg 


660 


gaacagactc tcagtataat 


gcctataatg aaaagaggaa 


gaagtttgag 


aaaaagactg 


720 


aataaatatc tcacgtaaac 


cttcctgaaa gataaacgtt 


ttcctgaatt 


cagaaactag 


780 


tagctaacat tgcttctgga 


gagcagaaat 


aagcatgtct 


tctggctact 


aagtgataaa 


840 


aagaacatta acaaccttta 


attaccttcc 


tagtgggaac 


tttttctact 


ttacctacaa 


900 


gttctatata tgtagaaatg 


aataaatata 


tatttaagta 


cagttttcat 


gaggaagttt 


960 


taaaagacca tgttcctaag 


cttccaagaa 


ggttttggat 


actagaagta 


ttaatctatg 


1020 


gcttttctcc cagtaaaacc 


ataggcctga agttcacatt 


gggtctttaa 


atcttttaga 


1080 


tatatactgg tcatttcaga 


aaattcttca 


tagtggtatt 


ggccttatat 


ttaacttttt 


1140 


ttttattttt tttttgagac 


aaagccacac 


tctgtctcct 


tggctggagt 


gtggtggcac 


1200 


agtctcagct cactgcaacc 


tctgcctccc agttcaagca 


attcttctgc 


ctcagcctcc 


1260 


caagtagctg ggattacagg 


cacccgccac 


cacgcccagc 


taatttttgt 


atttttgtag 


1320 


agatggggtt tcacgatgtt 


ggccaggctg gtctcaaact 


tctgacctca agtgatctgc 


1380 


ccaccttggc ctcccaaagt 


gctgggatta 


caggtgtaag 


ccactgcgcc 


cggccttttt 


1440 


aactttaaac atgttttaga 


attcacctaa agatcaaaat 


atcatggatt 


gaacctcatc 


1500 
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aattgatagc agtgagtgac tgaagcttcc aaatcaagaa aagccggcac caagaacttc 1560 

cattctaatc tagagctgac cagtttgagc tgattctctc tttgaagagt ccttcttgat 1620 

tgcagtgcag tactggcatt tctgaatgga tgtaagtgga gtattttagt ctaaaggctt 1680 

ttcaaattac ttgaattttt ttaaaaattg aggagcttta tttctattta cccttccatt 1740 

tttgtatatc aaatttccat tgtcattaaa aactgtatct tgaaactttg tgaactgact 1800 

tgctgtattt gcactttgag ctcttgaaat aaatgtgatt tttgtgtgat tatctggttt 1860 

ccagttttaa acattaactg tcacctttta ttcttaaact tgaaagtaca gaaatcatta 1920 

aattattaag ttgtacaata aagggattgg tttctt 1956 

<210> 11 

<211> 1488 

<212> DNA 

<213> Homo sapien 

<400> 11 

gactccaagt cccagctgtg cccgggacgc tgcgcgcccc cgggactcca tttcccagag 60 

tgccccgccc cagcctcccg gccccgcccc ccacctggct ccgcgggcag ccggagcgca 120 

cccggccgga agccgctgtc ggggagccgg cggtggggct ggacgcaggt gcaactgaca 180 

tgggtgaacc ccagggatcc atgcggattc tagtgacagg gggctctggg ctggtaggca 240 

aagccatcca gaaggtggta gcagatggag ctggacttcc tggagaggac tgggtgtttg 300 

tctcctctaa agacgccgat ctcacggata cagcacagac ccgcgccctg tttgagaagg 360 

tccaacccac acacgtcatc catcttgctg caatggtggg gggcctgttc cggaatatca 420 

aatacaattt ggacttctgg aggaaaaacg tgcacatgaa cgacaacgtc ctgcactcgg 480 

ccttcgaggt gggcgcccgc aaggtggtgt cctgcctgtc cacctgtatc ttccctgaca 540 

agacgaccta cccgatagat gagaccatga tccacaatgg gcctccccac aacagcaatt 600 

ttgggtactc gtatgccaag aggatgatcg acgtgcagaa cagggcctac ttccagcagt 660 

acggctgcac cttcaccgct gtcatcccca ccaacgtctt cgggccccac gacaacttca 720 

acatcgagga tggccacgtg ctgcctggcc tcatccacaa ggtgcacctg gccaagagca 780 

gcggctcggc cctgacggtg tggggtacag ggaatccgcg gaggcagttc atatactcgc 840 

tggacctggc ccagctcttt atctgggtcc tgcgggagta caatgaagtg gagcccatca 900 

tcctctccgt gggcgaggaa gatgaggtct ccatcaagga ggcagccgag gcggtggtgg 960 

aggccatgga cttccatggg gaagtcacct ttgatacaac caagtcggat gggcagttta 1020 

agaagacagc cagtaacagc aagctgagga cctacctgcc cgacttccgg ttcacaccct 1080 

tcaagcaggc ggtgaaggag acctgtgctt ggttcactga caactacgag caggcccgga 1140 
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agtgaagctg gaagacagga tcaggtgcca gcggaccatc ggctggcaga gcccagcggc 
caccacccgt caaccctgcc aggagctgag ggcaccaccc agcaacctgg gcctgcattc 
catccgctct gcagccccaa gcatctttcc agtggggccc ccattcacgt tggtcctcag 
ggaaaccagg gtccggggca ggcccggcgc tttgctcccc acaccagccc cctgcgcgtg 
tccactctga tcctgcatcc cactccctgg gagccaataa agtgcatttt cacaggcaaa 
aaaggggaag aaaacgacag ggggaggtag aaggagtagg aggagtga 

<210> 12 

<211> 1757 

<212> DNA 

<213> Homo sapien 

<400> 12 

gactccaagt cccagctgtg cccgggacgc tgcgcgcccc cgggactcca tttcccagag 
tgccccgccc cagcctcccg gccccgcccc ccacctggct ccgcgggcag ccggagcgca 
cccggccgga agccgctgtc ggggagccgg cggtggggct ggacgcaggt gcaactgaca 
tgggtgaacc ccagggatcc atgcggattc tagtgacagg gggctctggg ctggtaggca 
aagccatcca gaaggtggta gcagatggag ctggacttcc tggagaggac tgggtgtttg 
tctcctctaa agacgccgat ctcacggata cagcacagac ccgcgccctg tttgagaagg 
tccaacccac acacgtcatc catcttgctg caatggtggg gggcctgttc cggaatatca 
aatacaattt ggacttctgg aggaaaaacg tgcacatgaa cgacaacgtc ctgcactcgg 
ccttcgaggt gggcgcccgc aaggtggtgt cctgcctgtc cacctgtatc ttccctgaca 
agacgaccta cccgatagat gagaccatga tccacaatgg gcctccccac aacagcaatt 
ttgggtactc gtatgccaag aggatgatcg acgtgcagaa cagggcctac ttccagcagt 
acggctgcac cttcaccgct gtcatcccca ccaacgtctt cgggccccac gacaacttca 
acatcgagga tggccacgtg ctgcctggcc tcatccacaa ggtgcacctg gccaagagca 
gcggctcggc cctgacggtg tggggtacag ggaatccgcg gaggcagttc atatactcgc 
tggacctggc ccagctcttt atctgggtcc tgcgggagta caatgaagtg gagcccatca 
tcctctccgt gggcgaggaa gatgaggtct ccatcaagga ggcagccgag gcggtggtgg 
aggccatgga cttccatggg gaagtcacct ttgatacaac caagtcggat gggcagttta 
agaagacagc cagtaacagc aagctgagga cctacctgcc cgacttccgg ttcacaccct 
tcaagcaggc ggtgaaggag acctgtgctt ggttcactga caactacgag caggcccgga 
agtgaagctg gaagacagga tcaggtgcca gcggaccatc ggctggcaga gcccagcggc 
caccacccgt caaccctgcc aggagctgag ggcaccaccc agcaacctgg gcctgcattc 



1200 
1260 
1320 
1380 
1440 
1488 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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catccgctct 


gcagccccaa 


gcatctttcc 


ggaaaccagg 


gtccggggca 


ggcccggcgc 


tccactctga 


tcctgcatcc 


cactccctgg 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaagattccc 


cctcgggggg 


ggccgccaca 


tgtgtgtgaa 


aaagaagggg 


gggccccaat 


acgagatggg 


cgcccggctt 


taaaaaaacg 


taccggggca 


aacctgggga 


gaagagacac 


ggggcaacac 


actacct 




<210> 13 

<211> 1869 

<212> DNA 

<213> Homo sapien 




<400> 13 
gggagccgcg 


cgacgcctgc 


gcagggcccc 


cgggggagcc 


cgagcaggtt 


tccggctccc 


cgaggcctgt 


cctgccctgg 


aggaatctgg 


tcgagtgggc 


gtgggctgca 


cacaggcgcc 


tagctgcgaa 


ccgtgttaac 


gacctgtgcc 


ccctcttgtc 


agtcagaggg 


gagtagaatg 


ggttgggacc 


agctgctctg 


ggctgggtgg 


gagagctccc 


a ggggstacag 


ggctgcgacg 


tggttgccag 


tggactatgc 


cccactctga 


gagtggggct 


ccaaaggatg 


agagcgtgat 


gtgcaactga 


catgggtgaa 


ccccagggat 


ggctggtagg 


caaagccatc 


cagaaggtgg 


actgggtgtt 


tgtctcctct 


aaagacgccg 


tgtttgagaa 


ggtccaaccc 


acacacgtca 


tccggaatat 


caaatacaat 


ttggacttct 


tcctgcactc 


ggccttcgag 


gtgggcgccc 


tcttccctga 


caagacgacc 


tacccgatag 


acaacagcaa 


ttttgggtac 


tcgtatgcca 
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agtggggccc 


ccattcacgt 


tggtcctcag 


1320 


tttgctcccc 


acaccagccc 


cctgcgcgtg 


1380 


gagccaataa 


agtgcatttt 


cacaggcaaa 


1440 


aaaaaaaaaa 


aaaaaagggg 


gggggcgcca 


1500 


atattatgcc 


gcagcagccc 


acggtgtttc 


1560 


aaaggggggc 


gcgtaatata 


tgaaccccgc 


1620 


cggcggtgga 


gtgggcgaaa 


aaacgcgtga 


1680 


ccccaaacac 


ggaggggggg 


agaccaatcg 


1740 








1757 


ctcccctaag 


ggctggccct 


ggggccctgc 


60 


aggatgggat 


ccttgagggt 


ggctgcgccc 


120 


ctcgccaggg 


ctctccagga 


ccagccagcc 


180 


gtctcgtgcc 


tgtccctagc 


atgcgggctt 


240 


aggagccccc 


gtcccaaggt 


tgtctgcctc 


300 


taccttttct 


cggcctagtg 


gggagtcgtg 


360 


tagctcagga 


cccctgcggc 


gctgtaggct 


420 


agggagtgcc 


tcggtccagc 


tgggtctttc 


480 


gggcctcact 


ctgtgccagt 


tgccacagaa 


540 


gacctcagcc 


tcccactgtt 


tgacattaca 


600 


ccatgcggat 


tctagtgaca 


gggggctctg 


660 


tagcagatgg 


agctggactt 


cctggagagg 


720 


atctcacgga 


tacagcacag 


acccgcgccc 


780 


tccatcttgc 


tgcaatggtg 


gggggcctgt 


840 


ggaggaaaaa 


cgtgcacatg 


aacgacaacg 


900 


gcaaggtggt 


gtcctgcctg 


tccacctgta 


960 


atgagaccat 


gatccacaat 


gggcctcccc 


1020 


agaggatgat 


cgacgtgcag 


aacagggcct 


1080 



WO 2004/050900 



PCT/US2003/040131 



28 



acttccagca 


gtacggctgc 


accttcaccg 


ctgtcatccc 


caccaacgtc 


ttcgggcccc 


1140 


acgacaactt 


caacatcgag 


gatggccacg 


tgctgcctgg 


cctcatccac 


aaggtgcacc 


1200 


tggccaagag 


cagcggctcg gccctgacgg 


tgtggggtac 


agggaatccg 


cggaggcagt 


1260 


tcatatactc 


gctggacctg gcccagctct 


ttatctgggt 


cctgcgggag 


tacaatgaag 


1320 


tggagcccat 


catcctctcc 


gtgggcgagg 


aagatgaggt 


ctccatcaag 


gaggcagccg 


1380 


aggcggtggt 


ggaggccatg gacttccatg 


gggaagtcac 


ctttgataca 


accaagtcgg 


1440 


atgggcagtt 


taagaagaca gccagtaaca 


gcaagctgag 


gacctacctg 


cccgacttcc 


1500 


ggttcacacc 


cttcaagcag gcggtgaagg 


agacctgtgc 


ttggttcact 


gacaactacg 


1560 


agcaggcccg 


gaagtgaagc 


tggaagacag 


gatcaggtgc 


cagcggacca 


tcggctggca 


1620 


gagcccagcg 


gccaccaccc 


gtcaaccctg 


ccaggagctg 


agggcaccac 


ccagcaacct 


1680 


gggcctgcat 


tccatccgct 


ctgcagcccc 


aagcatcttt 


ccagtggggc 


ccccattcac 


1740 


gttggtcctc 


agggaaacca 


gggtccgggg 


caggcccggc 


gctttgctcc 


ccacaccagc 


1800 


cccctgcgcg 


tgtccactct 


gatcctgcat 


cccactccct 


gggagccaat 


aaagtgcatt 


1860 


ttcacaggc 












1869 


<210> 14 
<211> 1602 
<212> DNA 
<213> Homo sapien 












<400> 14 
gactccaagt 


cccagctgtg 


cccgggacgc 


tgcgcgcccc 


cgggactcca 


tttcccagag 


60 


tgccccgccc 


cagcctcccg 


gccccgcccc 


ccacctggct 


ccgcgggcag 


ccggagcgca 


120 


cccggccgga 


agccgctgtc 


ggggagccgg 


cggtggggct 


ggacgcaggt 


gcaactgaca 


180 


tgggtgaacc 


ccagggatcc 


atgcggattc 


tagtgacagg 


gggctctggg 


ctggtaggca 


240 


aagccatcca 


gaaggtggta 


gcagatggag 


ctggacttcc 


tggagaggac 


tgggtgtttg 


300 


tctcctctaa 


agacgccgat 


ctcacggata 


cagcacagac 


ccgcgccctg 


tttgagaagg 


360 


tccaacccac 


acacgtcatc 


catcttgctg 


caatggtggg 


gggcctgttc 


cggaatatca 


420 


aatacaattt 


ggacttctgg 


aggaaaaacg 


tgcacatgaa 


cgacaacgtc 


ctgcactcgg 


480 


ccttcgaggt 


gggcgcccgc 


aaggtggtgt 


cctgcctgtc 


cacctgtatc 


ttccctgaca 


540 


agacgaccta 


cccgatagat 


gagaccatga 


tccacaatgg 


gcctccccac 


aacagcaatt 


600 


ttgggtactc 


gtatgccaag 


aggatgatcg 


acgtgcagaa 


caggtcctcg 


cgcccatcct 


660 


gctgaggccc 


agccaggagt 


cggggagagg 


gcccccaggg 


gcgggtgggg 


ccggctggtc 


720 


gggtcttcgt 


gccagcctca 


gtaagaggcc 


agcggcagcc 


tggggtggga 


tcgcgggccc 


780 
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aggacgaggg caggagacgc tgtgcctcgg cctacttcca gcagtacggc tgcaccttca 840 

ccgctgtcat ccccaccaac gtcttcgggc cccacgacaa cttcaacatc gaggatggcc 900 

acgtgctgcc tggcctcatc cacaaggtgc acctggccaa gagcagcggc tcggccctga 960 

cggtgtgggg tacagggaat ccgcggaggc agttcatata ctcgctggac ctggcccagc 1020 

tctttatctg ggtcctgcgg gagtacaatg aagtggagcc catcatcctc tccgtgggcg 1080 

aggaagatga ggtctccatc aaggaggcag ccgaggcggt ggtggaggcc atggacttcc 1140 

atggggaagt cacctttgat acaaccaagt cggatgggca gtttaagaag acagccagta 1200 

acagcaagct gaggacctac ctgcccgact tccggttcac acccttcaag caggcggtga 1260 

aggagacctg tgcttggttc actgacaact acgagcaggc ccggaagtga agctggaaga 1320 

caggatcagg tgccagcgga ccatcggctg gcagagccca gcggccacca cccgtcaacc 1380 

ctgccaggag ctgagggcac cacccagcaa cctgggcctg cattccatcc gctctgcagc 1440 

cccaagcatc tttccagtgg ggcccccatt cacgttggtc ctcagggaaa ccagggtccg 1500 

gggcaggccc ggcgctttgc tccccacacc agccccctgc gcgtgtccac tctgatcctg 1560 

catcccactc cctgggagcc aataaagtgc attttcacag gc 1602 

<210> 15 

<211> 1693 

<212> DNA 

<213> Homo sapien 

<400> 15 

gactccaagt cccagctgtg cccgggacgc tgcgcgcccc cgggactcca tttcccagag 60 

tgccccgccc cagcctcccg gccccgcccc ccacctggct ccgcgggcag ccggagcgca 120 

cccggccgga agccgctgtc ggggagccgg cggtggggct ggacgcaggt gcaactgaca 180 

tgggtgaacc ccagggatcc atgcggattc tagtgacagg gggctctggg ctggtaggca 240 

aagccatcca gaaggtggta gcagatggag ctggacttcc tggagaggac tgggtgtttg 3 00 

tctcctctaa agacgccgat ctcacggata cagcacagac ccgcgccctg tttgagaagg 360 

tccaacccac acacgtcatc catcttgctg caatggtggg gggcctgttc cggaatatca 42 0 

aatacaattt ggacttctgg aggaaaaacg tgcacatgaa cgacaacgtc ctgcactcgg 48 0 

ccttcgaggt gggcgcccgc aaggtggtgt cctgcctgtc cacctgtatc ttccctgaca 540 

agacgaccta cccgatagat gagaccatga tccacaatgg gcctccccac aacagcaatt 600 

ttgggtactc gtatgccaag aggatgatcg acgtgcagaa caggtcctcg cgcccatcct 660 

gctgaggccc agccaggagt cggggagagg gcccccaggg gcgggtgggg ccggctggtc 720 

gggtcttcgt gccagcctca gtaagaggcc agcggcagcc tggggtggga tcgcgggccc 780 
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aggacgaggg 


caggagacgc 


tgtgcctcgg 


cctacttcca 


gcagtacggc 


tgcaccttca 


840 


ccgctgtcat 


ccccaccaac 


gtcttcgggc 


cccacgacaa 


cttcaacatc 


gaggatggcc 


900 


acgtgctgcc 


tggcctcatc 


cacaaggtgc 


acctggccaa 


gagtgagtgc 


gtccgtccga 


960 


tggggtgccc 


gggctcagct 


cgggaggagg 


tgacccagtg 


cctctgacac 


ctgctccttc 


1020 


cccaaatctg 


caggcagcgg 


ctcggccctg 


acggtgtggg 


gtacagggaa 


tccgcggagg 


1080 


cagttcatat 


actcgctgga 


cctggcccag 


ctctttatct 


gggtcctgcg 


ggagtacaat 


1140 


gaagtggagc 


ccatcatcct 


ctccgtgggc 


gaggaagatg 


aggtctccat 


caaggaggca 


1200 


gccgaggcgg 


tggtggaggc 


catggacttc 


catggggaag 


tcacctttga 


tacaaccaag 


1260 


tcggatgggc 


agtttaagaa 


gacagccagt 


aacagcaagc 


tgaggaccta 


cctgcccgac 


1320 


ttccggttca 


cacccttcaa 


gcaggcggtg 


aaggagacct 


gtgcttggtt 


cactgacaac 


1380 


tacgagcagg 


cccggaagtg 


aagctggaag 


acaggatcag 


gtgccagcgg 


accatcggct 


1440 


ggcagagccc 


agcggccacc 


acccgtcaac 


cctgccagga 


gctgagggca 


ccacccagca 


1500 


acctgggcct 


gcattccatc 


cgctctgcag 


ccccaagcat 


ctttccagtg 


gggcccccat 


1560 


tcacgttggt 


cctcagggaa 


accagggtcc 


ggggcaggcc 


cggcgctttg 


ctccccacac 


1620 


cagccccctg 


cgcgtgtcca 


ctctgatcct 


gcatcccact 


ccctgggagc 


caataaagtg 


1680 


cattttcaca 


ggc 










1693 


<210> 16 

<211> 2040 

<212> DNA 

<213> Homo sapien 












<400> 16 
gactccaagt 


cccagctgtg 


cccgggacgc 


tgcgcgcccc 


cgggactcca 


tttcccagag 


60 


tgccccgccc 


cagcctcccg 


gccccgcccc 


ccacctggct 


ccgcgggcag 


ccggagcgca 


120 


cccggccgga 


agccgctgtc 


ggggagccgg 


cggtggggct 


ggacgcaggt 


gcaactgaca 


180 


tgggtgaacc 


ccagggatcc 


atgcggattc 


tagtgacagg 


gggctctggg 


ctggtaggca 


240 


aagccatcca 


gaaggtggta 


gcagatggag 


ctggacttcc 


tggagaggac 


tgggtgtttg 


300 


tctcctctaa 


agacgccgat 


ctcacggata 


cagcacagac 


cccrcQccctcr 

^3 i3 


1 1 fccracraaocr 


360 


tccaacccac 


acacgtcatc 


catcttgctg 


caatggtggg 


gggcctgttc 


cggaatatca 


420 


aatacaattt 


ggacttctgg 


aggaaaaacg 


tgcacatgaa 


cgacaacgtc 


ctgcactcgg 


480 


ccttcgaggt 


gggcgcccgc 


aaggtggtgt 


cctgcctgtc 


cacctgtatc 


ttccctgaca 


540 


agacgaccta 


cccgatagat 


gagaccatga 


tccacaatgg 


gcctccccac 


aacagcaatt 


600 


ttgggtactc 


gtatgccaag 


aggatgatcg 


acgtgcagaa 


cagggcctac 


ttccagcagt 


660 
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acggctgcac 


cttcaccgct 


gtcatcccca 


ccaacgtctt 


cgggccccac 


gacaacttca 


720 


acatcgagga 


tggccacgtg 


ctgcctggcc 


tcatccacaa 


ggtgcacctg 


gccaagagca 


780 


gcggctcggc 


cctgacggtg 


tggggtacag 


ggaatccgcg 


gaggcagttc 


atatactcgc 


840 


tggacctggc 


ccagctcttt 


atctgggtcc 


tgcgggagta 


caatgaagtg 


gagcccatca 


900 


tcctctccgg 


tgggtacctc 


agccctcagc 


ccccttcctc 


catggtagga 


caggatccaa 


960 


ggctgtcctg 


ggaggctggc 


tgaggggtgg 


gagcaggaat 


attcatgggc 


cgtcgggtgg 


1020 


caccaagggg 


aggcctgacc 


cggtggctcc 


ccacatgggc 


gaggaagatg 


aggtctccat 


1080 


caaggaggca 


gccgaggcgg 


tggtggaggc 


catggacttc 


catggggaag 


tcaccgtatc 


1140 


ctctggctcc 


agtgctgggg 


gttgggcggg 


ggccccttgg 


gcgggctcag 


tggccacccg 


1200 


gtgtagggcc 


agcgtgggtg 


ggacactgcg 


gagggaggat 


gctcaccctg 


gcaggcggag 


1260 


gagctctgct 


gtggctcccc 


tgggaatggc 


agaggttcaa 


gggcaagctg 


cgggggctgg 


1320 


ggtggaggtg 


ctcttctcgc 


tgtggccttg 


acaaagcctc 


catttgatac 


aaccaagtcg 


1380 


gatgggcagt 


ttaagaagac 


agccagtaac 


agcaagctga 


ggacctacct 


gcccgacttc 


1440 


cggttcacac 


ccttcaagca 


999tgagccc 


tgaccccaca 


gccctccact 


tggtggaggc 


1500 


ctgccccagc 


ccctccctca 


tgctcgctac 


cagggcagag 


gtgaggtgcc 


ccctcagagc 


1560 


ctgtactgcc 


ctggaggtgg 


gacaccaaag 


ctctgagggg 


tgacttccct 


gtccctgggc 


1620 


ctcctccccc 


cacttcctcc 


catgcggccc 


cgaccctccc 


tgtccatggc 


cctcacctgc 


1680 


ctgcctctgc 


agcggtgaag 


gagacctgtg 


cttggttcac 


tgacaactac 


gagcaggccc 


1740 


ggaagtgaag 


ctggaagaca 


ggatcaggtg 


ccagcggacc 


atcggctggc 


agagcccagc 


1800 


ggccaccacc 


cgtcaaccct 


gccaggagct 


gagggcacca 


cccagcaacc 


tgggcctgca 


1860 


ttccatccgc 


tctgcagccc 


caagcatctt 


tccagtgggg 


cccccattca 


cgttggtcct 


1920 


cagggaaacc 


agggtccggg 


gcaggcccgg 


cgctttgctc 


cccacaccag 


ccccctgcgc 


1980 


gtgtccactc 


tgatcctgca 


tcccactccc 


tgggagccaa 


taaagtgcat 


tttcacaggc 


2040 


<210> 17 

<211> 2481 

<212> DNA 

<213> Homo sapien 












<400> 17 
cgcccagcgc 


agccccggcc 


gctgggcgca 


cccgtcccgt 


tcgtccccgg 


acgttgctct 


60 


ctaccccggg aacgtcgaga 


ctggagcgcc 


cgaactgagc 


caccttcgcg 


gaccccgaga 


120 


gcggcggcgc 


gactcaccgc 


ggaggcgccc 


ggacgcgtca 


agtgcgtact 


ttcaaccgta 


180 


aatgggtgct 


gagcgtttag 


gggtgcgggg 


caactcccta 


actcgtgggg 


ccctggccgg 


240 
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ccctgccgcg 


gcgtccgaga 


ccttctgatc 


cctcgctgcg 


cgcggcgcgg 


ggttggcgac 


. 300 


ttgggctcag 


gggcggcggg 


gcggggaggt 


gccttgacct 


agggacccag 


aggccgggag 


360 


caagtctcct 


ttcgccttgg 


actgcagtgt 


cccccagggc 


aaggcctgaa 


ccaggtcgcg 


420 


agggtctgtc 


tcccgtctgg 


ggagctgaac 


tgcccgagac 


actgtcgcag 


tcgcggggac 


480 


ttgtctggag 


cggcggggtc 


tgagcccaag 


tcggaagccg 


ccgccgccgc 


cagcgtcccg 


540 


tgcgcctgcc 


gccccgggac 


cagccgcgaa 


acttcttctc 


gggacggaga 


cagcgctcgc 


600 


cgcccgggcc 


gaggtaggag 


ctggctgggc 


tcggtagagc 


gcgggcgaat 


cgctttcggc 


660 


agcgaaggaa 


cgtgtcggga 


gaagccaact 


gcaggaccag 


gccccagcgg 


gcgcgcagag 


720 


acgcctgcct 


tgggcggagg 


tccccccctc 


cggttttcgt 


ggctctggta 


gggccctctg 


780 


tcccgtgagc 


tcctcactgg 


acctgaggct 


acaggcttca 


gcaggtcaag 


cgcccgagtg 


840 


tgcctggagg 


attggcggcg 


gggtggcatg 


agaaggtggc 


ccctcttctc 


catctcccag 


900 


tcaggcgatt 


tctctgctgt 


ctggggcagg 


ggcgcacacc 


ccaggaactt 


actacgtaga 


960 


gggaagccag 


cctggtctct 


cccccttgcg 


tcagtgctcc 


taagctgccc 


ctcaccactc 


1020 


tgggtagggg 


tctttcaggt 


gggctaggga 


attctccttg 


gaagtcccct 


tgtttgctcc 


1080 


cctccatcag 


ggcacctctg 


agttagccaa 


tcatcctctt 


ttctggcctc 


ttttttgcgg 


1140 


ggaggagagt 


agagagagaa 


gcctcctact 


cggcaggtct 


ctgagaggag 


gtttgggggg 


1200 


gcttctgagg 


ttgctcctcc 


tgggcagagt 


caaggggtgt 


tgagggttcc 


aacttttact 


1260 


ggacaatctc 


tcaagctctg 


attccctgat 


ttctgggagg 


tgtcagtgta 


gggcttgtgt 


1320 


cctgccccac 


ccacacagca 


tttggtgcag 


gcacatggga 


tggttcccca 


agcccttttc 


1380 


ccacagttag 


tttcagaaaa 


tgcataagcc 


acctaaggct 


gggagagcct 


ttctgtgctg 


1440 


gctgagatct 


ggtggcatgt 


gcatgcatat 


gtgtgcaagt 


gtgtgtgtgt 


gtgtgcaagt 


1500 


gtgtgtgcgc 


atgtgcatgt 


gtgtggaaag 


tatggtctgg 


gtggctgtaa 


gactggcccc 


1560 


tcttgtcccc 


tctgcaggcc 


cagcccaata 


ggaagcacct 


gatccacact 


ctttctagcc 


1620 


ccagggatgc 


tcaggaaagc 


agcctttctt 


tagggctgtc 


atctgttctg 


gggatctgtt 


1680 


gctgtttctt 


gggcccacag 


aatacactcc 


acacctcggg 


actccataat 


attggacaag 


1740 


gctggaaggc 


ccatttggct 


cctgcagcct 


cttatgcagg 


tgaggacaga 


cccaaggtcc 


1800 


aggcaatgct 


ggcctttgga 


acacagaacc 


tggctcactg 


actcccactc 


cagttgtctt 


1860 


ttccctctgc 


ctctggagca 


accctctctc 


ctccagagca 


acccctgtcc 


ctccttccta 


1920 


cctctctcta 


gggctgcacc 


cccagctcac 


cttagacctg 


gcccttccta 


cctgttgccc 


1980 


ctactgtctt 


gtcccatccc 


agtggtcagg 


agggagagta 


ctggccaaag 


gcccagctcc 


2040 
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acctgtgact taggggagtg ccaggcatca cccagaggtc ccggccctcg ggggcctggc 2100 

aggctctgct gtgggggaag cagagttaga actggggctg cttctccact ggccgtttgt 2160 

ctctgtcccc tccactggcc tcttgaagca cagagaccct ccgggtactt tccttcctct 2220 

gggcttccac tcatgctatt ccctgctccc tttttctacc tggagacccc cattcctagt 2280 

catcctcttc agcggtcctc ccagagctgt cctcagcatg gcagcctcca cagcccctgg 2340 

gtgtcacccc cggtttcgtg tctgcctcgg actccagacc tgcccctccc aggctggccc 2400 

agatggatcc tctacagtga ctaacccaaa ccagcaccag gcagggaccc aataaacctt 2460 

tgtgaatgga aaaaaaaaaa a 2481 

<210> 18 
<211> 2127 
<212> DNA 
<213> Homo sapien 

<400> 18 

aaagggttcg ccgatcttgg ggagccacac agcccgggtc gcaggcacct ccccgccagc 60 

tctcccgctt ctcgcacagc ttcccgacgc gtctgctgag ccccatggcc cacgccacgc 120 

tctccgccgc ccccagcaat ccccggctcc tgcgggtggc gctgctgctc ctgctcctgg 180 

tggccgccag ccggcgcgca gcaggtgggt cccggcgccc tggggtcccc gggccggacg 240 

cggctggggt gggcgccccg cgccgacagc cccgctcaat cagcgagtct attcttccct 3 00 

aggagcgtcc gtggtcactg aactgcgctg ccagtgcttg cagacactgc agggaattca 360 

cctcaagaac atccaaagtg tgaatgtaag gtcccccgga ccccactgcg cccaaaccga 42 0 

agtcatgtaa gtcccgcccc gcgcttccgc tgccaccgct ggggtccccg actctcccgc 480 

tgccccaacc ctgtccccag cccgacctcc tgtctcacgg aattcccttc tctctgcaga 54 0 

gccacactca agaatgggaa gaaagcttgt ctcaaccccg catcccccat ggttcagaaa 600 

atcatcgaaa agatactgaa caagtgagtt gtaatttcca tatacacagg cgactggagc 660 

cgttggtcag aaatactggc atctgccccc taaaaataaa atcagggaac cccaagagtt 720 

agctgaagga ctagaaattg tgattattat tttcacaatt aaagttgcca ttaaggttat 780 

taatctgctc tggtgccaga ggatatttca gtatttcagt gtcttccatt tccaggccta 840 

acccctggga gggtaaatgt gggtttctag aatactaagt aaattfcgact aacaatgcca 900 

aaatgccccc tgcctgtttt gggacgttta cctcagtccc taatgggctt cagacctgaa 960 

aaatgcaatg tctggtttgg gcactttggg cctgtaagcc ttcttggtga agagcagaaa 1020 

ctttgcagta acaccttcag tgagttcaag gctaggatcc ctgtcaacat tatttaatct 1080 

ctttgagctt tagcaaaata gggtagataa tggtagttat ttcatagggt tgtgtgcagc 1140 
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ttaagggaga tcatgaatgg acatttctaa cccaatgtga caatcagtga ccaatatgtt 1200 

ttggcttttt caaaccaaga atctagttga gtccacagcc acttctttct tgaaaaaaaa 1260 

aaaaaaaagc attcaggagt gcaagttttc tgctatagaa ggcagtgggt acagacagac 1320 

cttccctgag tggaggfcfctg ggagagagtc tgcatgggga catccctcca agcaacttca 1380 

gagtgacaga agagcagcct tccttagcac ctgcaacagt aacccttttc tcatcacagg 1440 

gggagcacca actgacagga gagaagtaag aagcttatca gcgtatcatt gacacttcct 1500 

gcagggtggt ccctgccctt accagagctg aaaatgaaaa agagaacagc agctttctag 1560 

ggacagctgg aaaggactta atgtgtttga ctatttctta cgagggttct acttatttat 1620 

gtatttattt ttgaaagctt gtattttaat attttacatg ctgttattta aagatgtgag 1680 

tgtgtttcat caaacatagc tcagtcctga ttatttaatt ggaatatgat gggttttaaa 1740 

tgtgtcatta aactaatatt tagtgggaga ccataatgtg tcagccacct tgataaatga 1800 

cagggtgggg aactggaggg tggggggatt gaaatgcaag caattagtgg atcactgtta 1860 

gggtaaggga atgtatgtac acatctattt tttatacttt ttttttaaaa aaagaatgtc 192 0 

agttgttatt tafctcaaatt atctcacatt atgtgttcaa catttttatg ctgaagtttc 1980 

ccttagacat tttatgtctt gcttgtaggg cataatgcct tgtttaatgt ccattctgca 204 0 

gcgtttctct ttcccttgga aaagagaatt tatcattact gttacatttg tacaaatgac 2100 

atgataataa aagttttatg aaaaaaa 2127 

<210> 19 

<211> 4323 

<212> DNA 

<213> Homo sapien 

<400> 19 

gggacgtccg tggtccttgt cgcacgtcgc agcgcctggc gcccgggaag aggtggttgt 60 

gaggcagacg aactcgcggc tctccggctt ccgaggcttc cgagttgtcg gaggaagggg 120 

gcggcgagca ataagaaccc gccgcacccg gtcctcagcg actcttctga cctccgcgcg 180 

acgtacccgc cgccgccgtt ggctggagca tttgacattg tgcagcaaag aaatggttat 240 

ggagaagccc agtccgctgc ttgtagggcg ggagtttgtg aggcaatatt atactttgct 300 

gaataaagct ccggaatatt tacacaggtt ttatggcagg aattcttcct atgttcatgg 36 0 

tggagtagat gctagtggaa agccccagga agctgtttat ggccaaaatg atatacacca 420 

caaagtatta tctctgaact tcagtgaatg tcatactaaa attcgtcatg tggatgctca 480 

tgcaaccttg agtgatggag tagttgtcca ggtcatgggt ttgctgtcta acagtggaca 54 0 

accagaaaga aagtttatgc aaacctttgt tctggctcct gaaggatctg ttccaaataa 600 
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attttatgtt cacaatgata tgtttcgtta tgaagatgaa gtgttatggt gattctgagc 660 

ctgaacttga tgaagaatca gaagatgaag tagaagagga acaagaagaa agacaaccat 720 

ctcctgaacc tgtgcaagaa aatgctaaca gtggttacta tgaagctcac cctgtgacta 780 

atggcataga ggagcctttg gaagaatcct ctcatgaacc tgaacctgag ccagaatctg 840 

aaacaaagac tgaagagctg aaaccacaag tggaggagaa gaacttagaa gaactagagg 900 

agaaatctac tactcctcct ccggcagaac ctgtttctct gccacaagaa ccaccaaagg 960 

ctttctcctg ggcttcagtg accagtaaaa acctgcctcc tagtggtact gtttcttcct 1020 

ctggaattcc accccatgtt aaagcaccag tctcacagcc aagagtcgaa gctaaaccag 1080 

aagttcaatc tcagccacct cgtgtgcgtg aacaacgacc tagagaacga cctggttttc 1140 

ctcctagagg accaagacca ggcagaggag atatggaaca gaatgactct gacaaccgta 1200 

gaataattcg ctatccagat agtcatcaac tttttgttgg taacttgcca catgatattg 1260 

atgaaaatga gctaaaggaa ttcttcatga gttttggaaa cgttgtggaa cttcgcatca 1320 

ataccaaggg tgttggggga aagcttccaa attttggttt tgtggttttt gatgactctg 1380 

aaccagttca gagaatctta attgcaaaac cgattatgtt tcgaggggaa gtacgtttaa 1440 

atgtggaaga gaaaaaaaca agagctgcaa gagagcgaga aaccagaggt ggtggtgatg 1500 

atcgcaggga tattaggcgc aatgatcgag gtcccggtgg tccacgtgga attgtgggtg 1560 

gtggaatgat gcgtgatcgt gatggaagag gacctcctcc aaggggtggc atggcacaga 162 0 

aacttggctc tggaagagga accgggcaaa tggagggccg cttcacagga cagcgtcgct 1680 

gaagctccac tgttggcaaa gtcttggcag tggtacatta ttcatcgtgt ttgcattctt 1740 

gttaattttt tttttggctt tggaatgtga cacagccttt ttgatcattt ctttgatgtg 1800 

aaaagcatct ttggttatca gttaaattga ggtggacatt atttccccaa tttcacaaca 1860 

ggattcacat tgttaattta taaatctaga cttggagaat taaggactga gaaatgacca 1920 

tatcttaaac tatctacgac aaagtgaact taaaaggaca tgcccactga attcaggtcc 1980 

tttgagtaaa aaaaaaatct tctgctgcac attttgttta agtgttactg tttctgcctg 2040 

ttaatgctgg gaacacaaat agtgcaattt gtgcaattgg agaatcttgc cttttttctt 2100 

ggctcccccc aaaaatacaa accaacagaa acttgttatg cactcatcaa aatgtactaa 2160 

tgggtactct gaactcatta acattgacat ctgcaacagg aggcaacagg gaaaaaatct 2220 

catcttcttt tccagtagaa aatagtttgt gaaatgatga gggcatttta tctgcttgct 2280 
gtgaccagcg tgtgtacaca taaaccttaa caagactaca agtatattcc agaaggaaat 2340 
cattttagtt atgaactaaa taataaaaat tagaacttca aatgcgatgg tcttgactat 24 00 
tagaccagat ttagtagctc catatctaag atttttctac ctgcccctct tcagtacagg 2460 
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gatggctggc 


tgctcaacac 


actcctcctc 


cccttttttc 


ctttctttaa 


gctgtgtaca 


2520 


gtgaaaattg 


tctttactgt 


atttttgttc 


tctggtaatg 


taataagcat 


gatggtgcct 


2580 


tctattaata 


catcattcca 


gtcttgctgg 


taattttgta 


cagtatagtg 


tatgaattgc 


2640 


tgtgctgcaa 


agccaaacag 


ctgcaaaatg 


ttgaaaaatc 


atcgaaatgt 


ataaaaattg 


2700 


cagtatcttt 


aaaatcagta 


aaatggacta 


gcatattatt 


tatcttgttc 


ttcagttaac 


2760 


aactttgtgt 


tctctgtggg 


agggagggag 


tcctgtgtgt 


ttgtggggag 


agggaaggag 


2820 


gaagtcagtt 


atttgagtaa 


gcctctagtt 


gacttttctc 


ttagcctgaa 


tgtggacgtt 


2880 


gaaacatatc 


acttcagggc 


ttggaaaagt 


cagtcaactt 


gacgtacatt 


tttagtgaca 


2940 


ttttaaaagc 


agtcagattc 


tataaatggc 


aagtaagcct 


gaagtgagga 


tactgcaatt 


3000 


ttcggagaaa 


agaacagcag 


ctctttaagt 


gtttgcattt 


tctatttggg 


gggcagggaa 


3060 


ctgtcattca 


ttttgcacaa 


ttcttgaact 


gatgtcagca 


cccgagtggc 


tcctgaattt 


3120 


aagtctggga 


cgacatcttt 


tatttttaca 


tgaatcttta 


aacaattctg 


tgagcaaagt 


3180 


ttgtagctgc 


tggattattg 


tctgtcttta 


tagcaagttc 


cagtaaacca 


caagtatggc 


3240 


aaagcttatc 


caattttatg 


cttggagcag 


tcagtacata 


ccagtttctg 


atgtttcagg 


3300 


caggagtggg 


gtaaataagt 


gtgaccactt 


aaagctgctc 


gttagcatgg 


aagacttctc 


3360 


cattctatct 


ttgtaaaaca 


gacaagatat 


gcacttgaca 


tagtagcaaa 


ttggttctga 


3420 


attatgcaac 


tgtttgctat 


ttagtaaact 


agcaaatgat 


gcatgtattt 


tgtttttcat 


3480 


gtactgggca 


atatgagtaa 


aatctgtccc 


tttttccccc 


tttgaatgag 


gtcttccatg 


3540 


tttgagggaa 


agtcttgcac 


tattgcatat 


attttgggga 


cacagatttt 


catagtttcc 


3600 


atttttgggg 


ggcttaagga 


tttttttttt 


ttctgtttga 


aacagtttta 


tactttctga 


3660 


tatagtactt 


gaaattctta 


ccagaaaatt 


actttggagt 


tttgaagcct 


ttattaatac 


3720 


tacttttaaa 


gaagcagttg 


ttttattgtc 


aatgtttttt 


ttcccccaag 


catattttct 


3780 


tgtatttctg 


tttccatata 


tatatatata 


tataatttcc 


aattcaggat 


attgccctgc 


3840 


catccatgaa 


aactgttctg 


gcaccaaaag 


taatgacaaa 


tgttaagtgt 


aataatagaa 


3900 


aagtagagca 


aagagccatt 


cagcttcagt 


ctttacatac 


catgaataaa 


acattaaaac 


3960 


atcatatgga 


gaagtttaca 


tggtgattgt 


tcacctgcag 


tactgtggag 


ttttaacatt 


4020 


ttgtcctctt 


ttcagtgaaa 


cagagtaaaa 


atattcatct 


accattactg 


ttatttgctg 


4080 


attttgtttt 


attttttgat 


ggtaatattc 


tatccttatg 


acactattgc 


aaccaaattg 


4140 


gctttaccat 


cttggcttta 


gtaggtatag 


aagacaatgg 


attaccatct 


ttattgctgt 


4200 


aatgtgttaa. 


gcattatatg 


ctagtagaat 


ctagtttaat 


tgtttcaggt 


ggaaagtatt 


4260 
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ctttgagttt 


ccatattgaa 


tgtgtttgga 


6 / 

ctaaacaaac 


aataaactac 


tgatgtctgc 


4320 


age 












4323 


<210> 20 

<211> 3590 

<212> DNA 

<213> Homo sapien 












<400> 20 
gactgggagg 


mamagagagt 


cagcttctgc 


tggcttccca 


atcagaagat 


gaagtagaag 


60 


aggaacaaga 


agaaagacaa 


ccatctcctg 


aacctgtgca 


agaaaatget 


aacagtggtt 


120 


actatgaagc 


tcaccctgtg 


actaatggca 


tagaggagee 


tttggaagaa 


tcctctcatg 


180 


aacctgaacc 


tgagccagaa 


tctgaaacaa 


agactgaaga 


gctgaaacca 


caagtggagg 


240 


agaagaactt 


agaagaacta 


gaggagaaat 


ctactactcc 


tcctccggca 


gaacctgttt 


300 


ctctgccaca 


agaaccacca 


aagecaagag 


tcgaagctaa 


accagaagtt 


caatctcagc 


360 


cacctcgtgt 


gcgtgaacaa 


cgacctagag 


aacgacctgg 


ttttcctcct 


agaggaccaa 


420 


gaccaggcag 


aggagatatg 


gaacagaatg 


actctgacaa 


cegtagaata 


attegctate 


480 


cagatagtca 


tcaacttttt 


gttggtaact 


tgccacatga 


tattgatgaa 


aatgagctaa 


540 


aggaattctt 


catgagtttt 


ggaaacgttg 


tggaacttcg 


catcaatacc 


aagggtgttg 


600 


ggggaaagct 


tccaaatttt 


ggttttgtgg 


tttttgatga 


ctctgaacca 


gttcagagaa 


660 


tettaattge 


aaaaccgatt 


atgtttcgag 


gggaagtacg 


tttaaatgtg 


gaagagaaaa 


720 


aaacaagagc 


tgcaagagag 


cgagaaacca 


gaggtggtgg 


tgatgatege 


agggatatta 


780 


ggcgcaatga 


tcgaggtccc 


ggtggtccac 


gtggaattgt 


gggtggtgga 


atgatgcgtg 


840 


atcgtgatgg 


aagaggacct 


cctccaaggg 


gtggcatggc 


acagaaactt 


ggctctggaa 


900 






ggccgcttca 


caggacagcg 


tegctgaage 


tccactgttg 


960 


gcaaagtctt 


ggcagtggta 


cattatfceat 


cgtgtttgca 


ttcttgttaa 


tttttttttt 


1020 


ggctttggaa 


tgtgacacag 


cctttttgat 


catttctttg 


atgtgaaaag 


catctttggt 


1080 


tatcagttaa 


attgaggtgg 


acattatttc 


cccaatttca 


caacaggatt 


cacattgtta 


1140 


atttataaat 


ctagacttgg 


agaattaagg 


actgagaaat 


gaccatatct 


taaactatct 


1200 


acgacaaagt 


gaacttaaaa 


ggacatgccc 


actgaattca 


ggtcctttga 


gtaaaaaaaa 


1260 


aatcttctgc 


tgcacatttt 


gtttaagtgt 


tactgtttct 


gcctgttaat 


gctgggaaca 


1320 


caaatagtgc 


aatttgtgca 


attggagaat 


ettgectttt 


ttcttggctc 


cccccaaaaa 


1380 


tacaaaccaa 


cagaaacttg 


ttatgeaetc 


atcaaaatgt 


actaatgggt 


actctgaact 


1440 


cattaacatt 


gaeatctgea 


acaggaggca 


acagggaaaa 


aatctcatct 


tcttttccag 


1500 
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tagaaaatag tttgtgaaat gatgagggca ttttatctgc ttgctgtgac cagcgtgtgt 1560 

acacataaac cttaacaaga ctacaagtat attccagaag gaaatcattt tagttatgaa 1620 

ctaaataata aaaattagaa cttcaaatgc gatggtcttg actattagac cagatttagt 1680 

agctccatat ctaagatttt tctacctgcc cctcttcagt acagggatgg ctggctgctc 1740 

aacacactcc tcctcccctt ttttcctttc tttaagctgt gtacagtgaa aattgtcttt 1800 

actgtatttt tgttctctgg taatgtaata agcatgatgg tgccttctat taatacatca 1860 

ttccagtctt gctggtaatt ttgtacagta tagtgtatga attgctgtgc tgcaaagcca 1920 

aacagctgca aaatgttgaa aaatcatcga aatgtataaa aattgcagta tctttaaaat 198 0 

cagtaaaatg gactagcata ttatttatct tgttcttcag ttaacaactt tgtgttctct 2040 

gtgggaggga gggagtcctg tgtgtttgtg gggagaggga aggaggaagt cagttatttg 2100 

agtaagcctc tagttgactt ttctcttagc ctgaatgtgg acgttgaaac atatcacttc 2160 

agggcttgga aaagtcagtc aacttgacgt acatttttag tgacatttta aaagcagtca 2220 

gattctataa atggcaagta agcctgaagt gaggatactg caattttcgg agaaaagaac 2280 

agcagctctt taagtgtttg cattttctat ttggggggca gggaactgtc attcattttg 2340 

cacaattctt gaactgatgt cagcacccga gtggctcctg aatttaagtc tgggacgaca 2400 

tcttttattt ttacatgaat ctttaaacaa ttctgtgagc aaagtttgta gctgctggat 246 0 

tattgtctgt ctttatagca agttccagta aaccacaagt atggcaaagc ttatccaatt 2520 

ttatgcttgg agcagtcagt acataccagt ttctgatgtt tcaggcagga gtggggtaaa 2580 

taagtgtgac cacttaaagc tgctcgttag catggaagac ttctccattc tatctttgta 2640 

aaacagacaa gatatgcact tgacatagta gcaaattggt tctgaattat gcaactgttt 2700 

gctatttagt aaactagcaa atgatgcatg tattttgttt ttcatgtact gggcaatatg 2760 

agtaaaatct gtcccttttt ccccctttga atgaggtctt ccatgtttga gggaaagtct 2820 

tgcactattg catatatttt ggggacacag attttcatag tttccatttt tggggggctt 2880 

aaggattttt tttttttctg tttgaaacag ttttatactt tctgatatag tacttgaaat 294 0 

tcttaccaga aaattacttt ggagttttga agcctttatt aatactactt ttaaagaagc 3 000 

agttgtttta ttgtcaatgt tttttttccc ccaagcatat tttcttgtat ttctgtttcc 3060 

atatatatat atatatataa atttccaatt caggatattg ccctgccatc catgaaaact 3120 

gttctggcac caaaagtaat gacaaatgtt aagtgtaata atagaaaagt agagcaaaga 3180 

gccattcagc ttcagtcttt acataccatg aataaaacat taaaacatca tatggagaag 3240 

tttacatggt gattgttcac ctgcagtact gtggagtttt aacattttgt cctcttttca 3300 

gtgaaacaga gtaaaaatat tcatctacca ttactgttat ttgctgattt tgttttattt 3360 
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tttgatggta 


atattctatc 


cttatgacac 


tattgcaacc 


aaattggctt 


taccatcttg 


3420 


gctttagtag gtatagaaga 


caatggatta 


ccatctttat 


tgctgtaatg 


tgttaagcat 


3480 


tatatgctag tagaatctag 


tttaattgtt 


tcaggtggaa 


agtattcttt 


gagtttccat 


3540 


attgaatgtg 


tttggactaa 


apaaacaata 




y utJ tyuayCa 




"lean 


<210> 21 

<211> 3200 

<212> DNA 

<213> Homo sapien 












<400> 21 
aggctgtcta 


gaggatccct 


cgggccacca 


agacaggcag 


aggagatatg 


gaacagaatg 


60 


actctgacaa 


ccgtagaata 


attcgctatc 


cagatagtca 


tcaacttttt 


gttggtaact 


120 


tgccacatga 


tattgatgaa 


aatgagctaa 


aggaattctt 


catgagttfct 


ggaaacgttg 


180 


tggaacttcg 


catcaatacc 


aagggtgttg 


ggggaaagct 


tccaaatttt 


ggttttgtgg 


240 


tttttgatga 


ctctgaacca 


gttcagagaa 


tcttaattgc 


aaaaccgatt 


atgtttcgag 


300 


gggaagtacg 


tttaaatgtg 


gaagagaaaa 


aaacaagagc 


tgcaagagag 


cgagaaacca 


360 


gaggtggtgg 


tgatgatcgc 


agggatatta 


ggcgcaatga 


tcgaggtccc 


ggtggtccac 


420 


gtggaattgt 


gggtggtgga 


atgatgcgtg 


atcgtgatgg 


aagaggacct 


cctccaaggg 


480 


gtggcatggc 


acagaaactt 


ggctctggaa 


gaggaaccgg 


gcaaatggag 


ggccgcttca 


540 


caggacagcg 


tcgctgaagc 


tccactgttg 


gcaaagtctt 


ggcagtggta 


cattattcat 


600 


cgtgtttgca 


ttcttgttaa 


tttttttttt 


ggctttggaa 


tgtgacacag 


cctttttgat 


660 


catttctttg 


atgtgaaaag 


catctttggt 


tatcagttaa 


attgaggtgg 


acattatttc 


720 


cccaatttca 


caacaggatt 


cacattgtta 


atttataaat 


ctagacttgg 


agaattaagg 


780 


actgagaaat 


gaccatatct 


taaactatct 


acgacaaagt 


gaacttaaaa 


ggacatgccc 


840 


actgaattca 


ggtcctttga 


gtaaaaaaaa 


aatcttctgc 


tgcacatttt 


gtttaagtgt 


900 


tactgtttct 


gcctgttaat 


gctgggaaca 


caaatagtgc 


aatttgtgca 


attggagaat 


960 


cttgcctttt 


ttcttggctc 


cccccaaaaa 


tacaaaccaa 


cagaaacttg 


ttatgcactc 


1020 


atcaaaatgt 


actaatgggt 


actctgaact 


cattaacatt 


gacatctgca 


acaggaggca 


1080 


acagggaaaa 


aatctcatct 


tcttttccag 


tagaaaatag 


tttgtgaaat 


gatgagggca 


1140 


ttttatctgc 


ttgctgtgac 


cagcgtgtgt 


acacataaac 


cttaacaaga 


ctacaagtat 


1200 


attccagaag 


gaaatcattt 


tagttatgaa 


ctaaataata 


aaaattagaa 


cttcaaatgc 


1260 


gatggtcttg 


actattagac 


cagatttagt 


agctccatat 


ctaagatttt 


tctacctgcc 


1320 


cctcttcagt 


acagggatgg 


ctggctgctc 


aacacactcc 


tcctcccctt 


ttttcctttc 


1380 
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tttaagctgt 


gtacagtgaa 


aattgtcttt 


actgtatttt 


tgttctctgg 


taatgtaata 


1440 


agcatgatgg 


tgccttctat 


taatacatca 


ttccagtctt 


gctggtaatt 


ttgtacagta 


1500 


tagtgtatga attgctgtgc 


tgcaaagcca 


aacagctgca 


aaatgttgaa 


aaatcatcga 


1560 


aatgtataaa 


aattgcagta 


tctttaaaat 


cagtaaaatg gactagcata 


ttatttatct 


1620 


tgttcttcag ttaacaactt 


tgtgttctct gtgggaggga gggagtcctg tgtgtttgtg 


1680 


gggagaggga 


aggaggaagt 


cagttatttg agtaagcctc 


tagttgactt 


ttctcttagc 


1740 


ctgaatgtgg acgttgaaac 


atatcacttc agggcttgga 


aaagtcagtc 


aacttgacgt 


1800 


acatttttag 


tgacatttta 


aaagcagtca gattctataa 


atggcaagta 


agcctgaagt 


1860 


gaggatactg 


caattttcgg 


agaaaagaac 


agcagctctt 


taagtgtttg 


cattttctat 


1920 


ttggggggca 


gggaactgtc 


attcattttg 


cacaattctt 


gaactgatgt 


cagcacccga 


1980 


gtggctcctg aatttaagtc 


tgggacgaca 


tcttttattt 


ttacatgaat 


ctttaaacaa 


2040 


ttctgtgagc aaagtttgta gctgctggat 


tattgtctgt 


ctttatagca agttccagta 


2100 


aaccacaagt 


atggcaaagc 


ttatccaatt 


ttatgcttgg agcagtcagt 


acataccagt 


2160 


ttctgatgtt 


tcaggcagga 


gtggggtaaa 


taagtgtgac 


cacttaaagc 


tgctcgttag 


2220 


catggaagac 


ttctccattc 


tatctttgta 


aaacagacaa 


gatatgcact 


tgacatagta 


2280 


gcaaattggt 


tctgaattat 


gcaactgttt 


gctatttagt 


aaactagcaa atgatgcatg 


2340 


tattttgttt 


ttcatgtact 


gggcaatatg 


agtaaaatct 


gtcccttttt 


ccccctttga 


2400 


atgaggtctt 


ccatgtttga 


gggaaagtct 


tgcactattg 


catatatttt 


ggggacacag 


2460 


attttcatag 


tttccatttt 


tggggggctt 


aaggattttt 


tttttttctg 


tttgaaacag 


2520 


ttttatactt 


tctgatatag 


tacttgaaat 


tcttaccaga 


aaattacttt 


ggagttttga 


2580 


agcctttatt 


aatactactt 


ttaaagaagc 


agttgtttta 


ttgtcaatgt 


tttttttccc 


2640 


ccaagcatat 


tttcttgtat 


ttctgtttcc atatatatat 


atatatataa 


atttccaatt 


2700 


caggatattg 


ccctgccatc 


catgaaaact 


gttctggcac 


caaaagtaat 


gacaaatgtt 


2760 


aagtgtaata atagaaaagt 


agagcaaaga gccattcagc 


ttcagtcttt 


acataccatg 


2820 


aataaaacat 


taaaacatca 


tatggagaag 


tttacatggt 


gattgttcac 


ctgcagtact 


2880 


gtggagtttt 


aacattttgt 


cctcttttca 


gtgaaacaga 


gtaaaaatat 


tcatctacca 


2940 


ttactgttat 


ttgctgattt 


tgttttattt 


tttgatggta 


atattctatc 


cttatgacac 


3000 


tattgcaacc aaattggctt 


taccatcttg gctttagtag gtatagaaga 


caatggatta 


3060 


ccatctttat 


tgctgtaatg 


tgttaagcat 


tatatgctag 


tagaatctag 


tttaattgtt 


3120 


tcaggtggaa 


agtattcttt 


gagtttccat 


attgaatgtg 


tttggactaa 


acaaacaata 


3180 
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aactactgat gtctgcagca 3200 

<210> 22 

<211> 627 

<212> DNA 

<213> Homo sapien 

<400> 22 

ggtggctccc gagtaggctc agagcgaccg tggccgggcg gaagcggctt tctggcctaa 60 

gctttaacag gcttcgcctg tgcttcctgt ttcctctttt accaaggacc cgccaacatg 120 

ggccgcgttc gcaccaaaac cgtgaagaag gcggcccggg tcatcataga aaagtactac 180 

acgcgcctgg gcaacgactt ccacacgaac aagcgcgtgt gcgaggagat cgccattatc 24 0 

cccagcaaaa agctccgcaa caagatagca ggttacgtca cgcatctgat gaagcgaatt 3 00 

cagagaggcc cagtaagagg tatctccatc aagctgcagg aggaggagag agaaaggaga 360 

gacaattatg ttcctgaggt ctcagccttg gatcaggaga ttattgaagt agatcctgac 42 0 

actaaggaaa tgctgaagct tttggasttc ggcagtctgt scaaccttca ggtcattcat 480 

cccaactgta ggctgagtga cctgaaggtt ggacagactg cagttgggat gaatttcaaa 540 

acgcctcggg gacctgtttg aattttttct gtagtgctgt attattttca ataaatctgg 600 

gacaacagca aaaaaaaaaa aaaaaaa 627 

<210> 23 

<211> 3187 

<212> DNA 

<213> Homo sapien 

<400> 23 

ttgctcgagc ggccggcgcc agtgtgctgg aaaggcgcgg gcttcgttcg taaggaaggg 60 

ggcctaggcc cgggcctgcg gtggtggggg ttgctgcgcg ccgggggtcg ctcctgctgt 120 

gtcttccgct ccagcttcgc ccacttcccc ttgccagcgg ggtgggcgcg gagaagacct 180 

gccggagcca tggaggacga agtggtccgc tttgccaaga agatggacaa gatggtgcag 240 

aagaagaagg tgagcgcggc gagcggcgcg ggccgggagg gaggccgggc cgcggtccgg 300 

cccgaggggg ccgtcccctc cctccttctc cctcccctcc ccctgcagcc tcccgtctgg 360 

gagccgggcc cgccgcgccc gcgtccccgg ggagggtgtc cgggcccggg gcgtcgggtc 420 

ccgcggcaac ctgcgttagg acccgctcgc tcttgtattt ccagtccctg ctcgctggtg 480 

ccgtgtaata aacctaatcg atttagggat tagggcacgc tctccgttga ctttaaactt 54 0 

tcgtgctagt aaaaccttgg catttagctt aggaattgma gcgtagaaag tcctcagaac 600 

ttgaaagaat ggagtcttcc gtgatccgtg gccaacatct agaaagagca tcaaatagag 660 

gcttgtgatc ggtattagac atcgggctgg taaaatgtaa aacttctgtg tcttattttt 720 
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gaagagggct 


gaacgtccgc 


ctggttatct 


caactgtgct 


acatgtttgc 


ttagcgatta 


780 


aaaacgcggc 


tggagcattg 


gatttgctaa 


aggagcttaa 


gaatattcct 


atgaccctgg 


840 


aattactgca 


gtccacaaga 


atcggaatgt 


cagttaatgc 


tattcgcaag 


cagagtacag 


900 


atgaggaagt 


tacatctttg 


gcaaagtctc 


tcatcaaatc 


ctggaaaaaa 


ttattagatg 


960 


ggccatcaac 


tgagaaagac 


cttgacgaaa 


agaagaaaga 


acctgcaatt 


acatcgcaga 


1020 


acagccctga 


ggcaagagaa 


gaaagtactt 


ccagcggcaa 


tgtaagcaac 


agaaaggatg 


1080 


agacaaatgc 


tcgagatact 


tatgtttcat 


cctttcctcg 


ggcaccaagc 


acttctgatt 


1140 


ctgtgcggtt 


gaagtgtagg 


gagatgcttg 


ctgcagctct 


tcgaacaggg 


gatgactaca 


1200 


ttgcaattgg 


agctgatgag 


gaagaattag 


gatctcaaat 


tgaagaagct 


atatatcaag 


1260 


aaataaggaa 


tacagacatg 


aaatacaaaa 


atagagtacg 


aagtaggata 


tcaaatctta 


1320 


aagatgcaaa 


aaatccaaat 


ttaaggaaaa 


atgtcctctg 


tgggaatatt 


cctcctgact 


1380 


tatttgctag 


aatgacagca 


gaggaaatgg 


ctagtgatga 


gctgaaagag 


atgcggaaaa 


1440 


acttgaccaa 


agaagccatc 


agagagcatc 


agatggccaa 


gactggtggg 


acccagactg 


1500 


acttgttcac 


atgtggcaaa 


tgtaaaaaga 


agaattgcac 


ttacacacag 


gtacaaaccc 


1560 


gtagtgctga 


tgaaccaatg 


acaacatttg 


ttgtctgtaa 


tgaatgtgga 


aatcgatgga 


1620 


agttctgttg 


agttggaaga 


attggcaaaa 


tatctggacc 


attaagaaaa 


cggattttgt 


1680 


aactagcttt 


aaactaggcc 


aagcaactag 


ttttcctgca 


aatcaaattt 


ttaaagcaac 


1740 


ttgggttaga 


ctttgttttt 


gacctaacat 


cccttcctta 


aatgccttct 


gtagtttcag 


1800 


atcagtaggg 


agaccatata 


ataattttat 


ggtacctgtt 


tcaaaacata 


ttttttctgt 


1860 


ttttataagt 


aagttgatat 


taattaaact 


cttggcaata 


tttcttcttt 


cttaaaggaa 


1920 


aatatacctt 


aacttttttt 


cttttacact 


gtgaaacata 


cacagtagaa 


attctgttac 


1980 


tctctgttat 


taatacataa 


atgaaaatac 


atttttttcc 


atattggcat 


gtagctacaa 


2040 


atattaaagg 


aggagaaaag 


gtaatataat 


tttaggttta 


ccaaatatgg 


tgtgtattca 


2100 


aataatactt 


gaccagctta 


tctaaaatgt 


acataatttt 


gaggtagctt 


atgaatttga 


2160 


ttttaattat 


tatgttcaca 


agcttggaat 


attagatatt 


attttgcatc 


tgtaactaac 


2220 


cgtgatcatc 


atttcttgta 


atttcttgta 


catgtatatt 


acttgttctt 


aatagatttt 


2280 


tggaaacaag 


actttattga 


gatcagtttg 


gttttcctgt 


taatttacct 


gtttgacttt 


2340 


ataatgtgtt 


ttagttttgc 


agaagaacac 


tgttgtagtt 


tagaaggctt 


ttcataaatc 


2400 


ccctcatagg 


caaagatgaa 


aacttcccac 


tatttttttc 


ccctcttagg 


aagacatact 


2460 


ggaaagaaaa 


tgtttagcat 


cttagtgtag 


tatagctatt 


gtaaacagtt 


catgactaga 


2520 
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ttttgattcg 


gaaatctata 


ctgaccaagg 


attaatctta 


aggattgtat 


aattcattaa 


2580 


agctgtggtc 


tttccatgtg 


gagactgata 


gaaaataatt 


ttgtcccaag 


tcttatttgc 


2640 


tgactttttc 


tgtcatgagt 


gagattgttg 


aacaaactga 


atatatgggc 


tatagcaagt 


2700 


agctttacag 


tacagatctt 


acaattaagt 


tttgcttttg 


ttaaagtgtg 


taccattttt 


2760 


tctgtttgga 


gtaagacaaa 


aattgttttg 


acataggttc 


cctagggtac 


acttgctcta 


2820 


gcatacttta 


aaggccactg 


ttgcaaagtc 


tacattttat 


gctgaatctg 


cattctgtca 


2880 


ggcacccgta 


gaaagacctc 


agtacatgct 


ttgcactctc 


ctttgctccc 


tttttccaat 


2940 


ttcttattgc 


atatcatttt 


gttgtaatac 


agaaagcagc 


atttttaaat 


gtccgtgtta 


3000 


agaattggcc 


cactggtacc 


aactcacctc 


tattttgtca 


gttcatagtt 


gaagattttg 


3060 


ttttatttca 


aaaacaaagt 


acatttttga 


aataatgttt 


cagaataaaa 


taatctcact 


3120 


tttaagtgat 


ccattttaaa 


atttgtaatt 


caataaagtt 


ttttttgttg 


ttaaacataa 


3180 


aaaaaaa 












3187 


<210> 24 
<211> 2990 
<212> DNA 
<213> Homo sapien 












<400> 24 
cccggcattc 


cgtgctcctt 


ggttccggcg 


ttggagctct 


ttggggccca 


gctttgcgga 


60 


cccgggagct 


cgggacgcag 


gcggggcttg 


tgctccgcgg 


gggcagggcg 


tagggtgggc 


120 


ctcctacctc 


ccctgatctc 


gcggtttgtt 


ccgtttcatt 


ggagcttccc 


ggaccgtgtg 


180 


ctcgacggtg 


ccctaggtgc 


cgtggggcca 


cacgcgagtc 


tgataagcac 


cctcccccgg 


240 


aatcatgcgg 


tgctgtgagg 


cctagcgaag 


atgaagatag 


aatgcaaggt 


agaaagtgct 


300 


ggataccttt 


agaaagctgc 


aggactggtg 


cgatgggagt 


tgagacgtaa 


gaacctgccc 


360 


gtccgtaggg 


ctctggatgc 


tgctgaggcc 


cgaggcccct 


atggcagatt 


tgaaaattca 


420 


cccttgtaga 


gtcattcctg 


cctttgagcg 


gactcccttt 


taagagatct 


caagagagcg 


480 


ttcggtggag 


gccctgggtc 


tgcacagctc 


acctccctgg 


gaactgctcg 


cccgagcgtc 


540 


ggagccggcg 


ctggccccct 


gcagccggaa 


ggttgcagcc 


gcaggagccc 


cggaggccca 


600 


ggacacaggg 


ctcttgctct 


tgcagaatcc 


acaggtcttt 


cttgaggaaa 


tctgtagaca 


660 


gaactttgtg 


ctgcgttttt 


atctagggaa 


ggaacagaag 


agtgtcgtct 


cctagaaatc 


720 


tagcactgga 


gaaacgagga 


aaattcttcc 


agcgatggtc 


tcccactcag 


agctgaggaa 


780 


gcttttctac 


tcagcagatg 


ctgtgtgttt 


tgatgttgac 


agcacggtca 


tcagagaaga 


840 


aggaatcgat 


gagctagcca 


aaatctgtgg 


cgttgaggac 


gcggtgtcag 


aaatgacacg 


900 
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ggcggggcag 


4" rtr«o 4- 4* 4- a a 

LyccuLLCaa 


agctgctctc 


acagageget 


tagccctcat 


960 


<T"«a. CI C 4- 


aggg ag cagg 


cgcagagacc 


catagcagag 


caacccccac 


acctgacccc 


1020 




yaycugyCaa 


gucgccuaca 


ggagcgaaat 


gttcaggttt 


tcctaatatc 


1080 




a rtrra fr 4- a f* f* rr 

cty y ciy uau 


4~ a rraAMa 4~ 4- 

LayayCaLyL 


tget tcaaag 


ctcaatatcc 


cagcaaccaa 


1140 


f- Cft" af rr/-» 


aaLayyCLya 


SOKI" 4* y-« 4— 4» 

ddtCCCaCCC 


taacggtgaa 


tatgeaggtt 


ttgatgagac 


1200 


CTf~!a CI tT'a spa 


y ULyactLL Ly 


y tyyu a. ci ayy 


aaaagtgatt 


aaacttttaa 


aggaaaaatt 


1260 


tcattthaao 


3 B n n t* *n r^aa 
aaaauaaL.L.a 


ugaueggaga 


tggtgccaca 


gatatggaag 


cctgtcctcc 


1320 


t" err* f" era f* i-t/"« f* 


4" 4» ^» a 4» §■ ST/T -a 4~ 

tCCaCCyyat 


ttggaggaaa 


tgt gat cagg 


caacaagtca aggataaege 


1380 




4- /~»ra /"*• t~ rrs» t~ 
aUuaCLyaL C 


LLy Cdyay Cl 


gctgggagaa 


ctggaagaat 


aacatccatt 


1440 




CCCaaaCdaC 


t t cagatgaa 


tctttacaag 


ttatacagat 


tgatactgtt 


1500 


t~ err* t~ t~ a narrf 
uy v< »— Lauoy L. 


tycctaCiac 


aaccLgctac 


agaaag t t gg 


tacaaatgat 


ctgtacttta 


1560 




a fTf*ra a fr* ^/**4- a 
ayyaot CCCa 


gaagatcgcL 


4-I-4-4-4-4-+-4-4-4- 


ttttaactgt 


agttccagta 


1620 


ttatah oa t~ or 


au La t. uyaut 


ucceggagag 


^v4- 4>4-4-4>4»4»4>4» 

gcttcttttt 


tttttgagac 


agaatcttgc 


1680 


UtL.y L uytLC 


aggctggagt 


gcagtggcgc 


ggtctegget 


cactgcaagc 


tctgcctccc 


1740 


a ^in 4- 4- /-« ^ /— * j— • 

ayy t CCaCy C 


catccLCCtg 


cctcagcctc 


ccgagtagct 


gggactacag gcgcccgcca 


1800 




CLdaLLLLLL 


gtatttttag 


tagagaeggg 


gtttgaccgt 


gttagecagg 


1860 




^- ^ f% ^ x*t ^ y-i 

cccccugacc 


ttgtgatccg 


cctgcctcag 


cctcccaaag 


tgctgggatt 


1920 


B v»ayy^L L-yy 


y L>L>aL.L>y oyc 


ccagccaaug 


tcctagagag 


ttttgtgatc 


tgaattcttt 


1980 


nt-art-at-at-t-t- 
auy LctLct L u u 


gLdyCLdtaL 


t tcatacaaa 


gtgctttaag 


tgtggagagt 


caattaaaca 


2040 


LLltlaCLCt 


t agaaatacg 


gatteggcag 


ccttcagtga 


atattggttt 


ctctttggta 


2100 


Ly I t_-ctd L-cLclci 


aytLtatccg 


tatgtcagaa 


cggatttgtg 


gaattttcag 


ttctgcctct 


2160 


f- cf/-ts» t* a t~ t* r?rrr 
L.yL»aLaL uyy 


aa a 4™ a z™' ^» a a 4* 


att t cagg t a 


ccgtgttgaa 


gcataaatag 


cccccaggca 


2220 


aaaaacaaar 


t~ t" a a rra +* r* t* 
L. LaayaL LLL 


ant*4 , f ^ys*i 4~ 4» * 

dctLcgccca 


gtgctgggta 


caatgtcctc 


accccatgag 


2280 


CIV— l_ ^ 1^. d 


4~ 4— « f» 4* /ta a r^r 

cc. LCCbyaay 


cccugggacc 


tggaaagtcc 


ttttttttac 


ttcttgggat 


2340 


fcfccjt" C*ri t* 3 t~ a 

^ y L^aua La 


4- craa a a t* r^r 4- 4™ a 
LydddLy LLa 


caaaagccCL 


ccgttttttg 


ttttcctcca 


atataacaat 


2400 


arja erf r*t~ t- a a 


ar>a a a 3 ^r4- a ^ 

aCoaaayLaC 


cgaLtaaaaa 


tggatatttg 


ggctcaggcc 


tgtaatctca 


2460 


gcactgtggg 


aggacaagtt 


aagaggatca 


cttgaggaca 


ggaattcgag accagcctgg 


2520 


acaacagagc 


aagaccctgt 


etctgeaaaa 


aataaaaaca 


ttagctggat 


gtggcggcac 


2580 


gcacctgtag 


tcccagctac 


ttggtgggct 


gagatgggag 


gatcccttga 


actcagtagt 


2640 


teaaggctge 


agtgagctgt 


gactctgcca 


ttgtactcca 


gcctgggtga 


cagtgagacc 


2700 


ccgtctctta 


aaaaaaaaaa 


aaaaaaaaaa 


gatttttgea 


aaatccctta 


aagttaaaag 


2760 
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aattaaaaac aggttttact aattcacaag ggccagcatt ttcccctaaa tgaattaaat 2820 

tttaataact cctaagaggc tattataaac catttctttt ttataaatgc cttggaaata 288 0 

ctccgaaagc caggtttatg tttgttttga gcagagccaa gcaagggaaa tttcagaaca 2940 

tgctttgaat gtgtgagatg ctataccctc cctggaggaa aaatgggggg 2990 

<210> 25 

<211> 3153 

<212> DNA 

<213> Homo sapien 

<400> 25 

cccggcattc cgtgctcctt ggttccggcg ttggagctct ttggggccca gctttgcgga 60 

cccgggagct cgggacgcag gcggggcttg tgctccgcgg gggcagggcg tagggtgggc 120 

ctcctacctc ccctgatctc gcggtttgtt ccgtttcatt ggagcttccc ggaccgtgtg 180 

ctcgacggtg ccctaggtgc cgtggggcca cacgcgagtc tgataagcac cctcccccgg 240 

aatcatgcgg tgctgtgagg cctagcgaag atgaagatag aatgcaaggt agaaagtgct 3 00 

ggataccttt agaaagctgc aggactggtg cgatgggagt tgagacgtaa gaacctgccc 360 

gtccgtaggg ctctggatgc tgctgaggcc cgaggcccct atggcagatt tgaaaattca 420 

cccttgtaga gtcattcctg cctttgagcg gactcccttt taagtttaca gaagcacttg 480 

cagaactcat cagaagccac cccgcttatc agcagatctc aagagagcgt tcggtggagg 540 

ccctgggtct gcacagctca cctccctggg aactgctcgc ccgagcgtcg gagccggcgc 600 

tggccccctg cagccggaag gttgcagccg caggagcccc ggaggcccag gacacaggac 660 

agagtctcgt tctgtcgccc aggctggagt gcagtggtgt gatctcggct cacttcaacc 720 

tccgcctaca gggttcaagg gattctcctg cctcagcctc ctgagtatct gggctcttgc 780 

tcttgcagaa tccacaggtc tttcttgagg aaatctgtag acagaacttt gtgctgcgtt 840 

tttatctagg gaaggaacag aagagtgtcg tctcctagaa atctagcact ggagaaacga 900 

ggaaaattct tccagcgatg gtctcccact cagagctgag gaagcttttc tactcagcag 960 

atgctgtgtg ttttgatgtt gacagcacgg tcatcagaga agaaggaatc gat gage tag 1020 

ccaaaatctg tggcgttgag gacgcggtgt cagaaatgac acggcgagcc atgggcgggg 1080 

cagtgccttt caaagctget ctcacagagc gcttagccct catccagccc tccagggagc 1140 

aggtgeagag actcatagca gagcaacccc cacacctgac ccccggcata agggagctgg 1200 

taagtcgect acaggagega aatgttcagg ttttcctaat atctggtggc tttaggagta 1260 

ttgtagagca tgttgcttca aagctcaata tcccagcaac caatgtattt gecaatagge 1320 

tgaaattcta etttaaeggt gaatatgcag gttttgatga gacgcagcca acagctgaat 1380 
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ctggtggaaa 


aggaaaagtg 


attaaacttt 


taaaggaaaa 


atttcatttt 


aagaaaataa 


1440 


tcatgattgg 


agatggtgcc 


acagatatgg 


aagcctgtcc 


tcctgctgat 


gctttcattg 


1500 


gatttggagg 


aaatgtgatc 


aggcaacaag 


tcaaggataa 


cgccaaatgg 


tatatcactg 


1560 


attttgtaga 


gctgctggga 


gaactggaag 


aataacatcc 


attgtcgtac 


agctccaaac 


1620 


aacttcagat 


gaatttttac 


aagttataca 


gattgatact 


gtttgcttac 


agttgcctat 


1680 


tacaacttgc 


tatagaaagt 


tggtacaaat 


gatctgtact 


ttaaactaca 


gttaggaatc 


1740 


ctagaagatt 


gctttttttt 


tttttttaac 


tgtagttcca 


gtattatatg 


atgactattg 


1800 


atttcctgga 


gaggtttttt 


ttttttttga 


gacagaatct 


tgctctgttg 


cccaggctgg 


1860 


agtgcagtgg 


cgcggtctcg 


gctcactgca 


agctctgcct 


cccaggttca 


cgccattctc 


1920 


ctgcctcagc 


ctcccgagta 


gctgggacta 


caggcgcccg 


ccaccacatc 


cggctaattt 


1980 


tttgtatttt 


tagtagagac 


ggggtttgac 


cgtgttagcc 


aggatggtct 


tgatctcctg 


2040 


accttgtgat 


ccgcctgcct 


cagcctccca 


aagtgctggg 


attacaggct 


tgggccaccg 


2100 


cgcccagcca 


atgtcctaga 


gagttttgtg 


atctgaattc 


tttatgtata 


tttgtagcta 


2160 


tatttcatac 


aaagtgcttt 


aagtgtggag 


agtcaattaa 


acacctttac 


tcttagaaat 


2220 


acggattcgg 


cagccttcag 


tgaatattgg 


tttctctttg 


gtatgtcaat 


aaaagtttat 


2280 


ccgtatgtca 


gaacggattt 


gtggaatttt 


cagttctgcc 


tcttgcatat 


tggaataccc 


2340 


aatatttcag 


gtaccgtgtt 


gaagcataaa 


tagcccccag 


gcaagaaaca 


ggcttaagac 


2400 


tctactttgc 


ttagtgctgg 


gtacaatgtc 


ctcaccccat 


gagcccactc 


ccacttcctg 


2460 


aagccctggg 


acctggaaag 


tccttttttt 


tacttcttgg 


gatttgtcat 


atatgaaatg 


2520 


ttacaaaagt 


tttttgtttt 


ttgttttcct 


ccaatataac 


aatagagtct 


taaacaaaag 


2580 


tactgattaa 


aaatggatat 


ttgggctcag 


gcctgtaatc 


tcagcactgt 


gggaggacaa 


2640 


gttaagagga 


tcacttgagg 


acaggaattc 


gagaccagcc 


tggacaacag 


agcaagaccc 


2700 


tgtctctgca 


aaaaataaaa 


acattagctg 


gatgtggcgg 


cacgcacctg 


tagtcccagc 


2760 


tacttggtgg 


gctgagatgg 


gaggatccct 


tgaactcagt 


agttcaaggc 


tgcagtgagc 


2820 


tgtgactctg 


ccattgtact 


ccagcctggg 


tgacagtgag 


accccgtctc 


ttaaaaaaaa 


2880 


aaaaaaaaaa 


aaagattttt 


gcaaaatccc 


ttaaagttaa 


aagaattaaa 


aacaggtttt 


2940 


actaattcac 


aagggccagc 


attttcccct 


aaatgaatta 


aattttaata 


actcctaaga 


3000 


ggctattata 


aaccatttct 


tttttataaa 


tgccttggaa 


atactccgaa 


agccaggttt 


3060 


atgtttgttt 


tgagcagagc 


caagcaaggg 


aaatttcaga 


acatgctttg 


aatgtgtgag 


3120 


atgctatacc 


ctccctggag 


gaaaaatggg 


ggg 






3153 
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<211> 2968 
<212> DNA 
<213> Homo sapien 




<400> 26 
cccggcattc 


cgtgctcctt 


ggttccggcg 


cccgggagct 


cgggacgcag 


gcggggcttg 


ctcctacctc 


ccctgatctc 


gcggtttgtt 


ctcgacggtg 


ccctaggtgc 


cgtggggcca 


aatcatgcgg 


tgctgtgagg 


cctagcgaag 


ggataccttt 


agaaagctgc 


aggactggtg 


gtccgtaggg 


ctctggatgc 


tgctgaggcc 


cccttgtaga 


gtcattcctg 


cctttgagcg 


cagaactcat 


cagaagccac 


cccgcttatc 


ccctgggtct 


gcacagctca 


cctccctggg 


tggccccctg 


cagccggaag 


gttgcagccg 


ctagggaagg 


aacagaagag 


tgtcgtctcc 


attcttccag 


cgatggtctc 


ccactcagag 


gtgtgttttg 


atgttgacag 


cacggtcatc 


atctgtggcg 


ttgaggacgc 


ggtgtcagaa 


cctttcaaag 


ctgctctcac 


agagcgctta 


cagagactca 


tagcagagca 


acccccacac 


cgcctacagg 


agcgaaatgt 


tcaggttttc 


gagcatgttg 


cttcaaagct 


caatatccca 


ttctacttta 


acggtgaata 


tgcaggtttt 


ggaaaaggaa 


aagtgattaa 


acttttaaag 


attggagatg 


gtgccacaga 


tatggaagcc 


ggaggaaatg 


tgatcaggca 


acaagtcaag 


gtagagctgc 


tgggagaact 


ggaagaataa 


cagatgaatt 


tttacaagtt 


atacagattg 


cttgctatag 


aaagttggta 


caaatgatct 


agattgcttt 


tttttttttt 


ttaactgtag 
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ttggagctct 


ttggggccca 


gctttgcgga 


60 


tgctccgcgg gggcagggcg 


tagggtgggc 


120 


ccgtttcatt 


ggagcttccc 


ggaccgtgtg 


180 


cacgcgagtc 


tgataagcac 


cctcccccgg 


240 


atgaagatag aatgcaaggt 


agaaagtgct 


300 


cgatgggagt 


tgagacgtaa 


gaacctgccc 


360 


cgaggcccct 


atggcagatt 


tgaaaattca 


420 


gactcccttt 


taagtttaca 


gaagcacttg 


480 


agcagatctc 


aagagagcgt 


tcggtggagg 


540 


aactgctcgc 


ccgagcgtcg 


gagccggcgc 


600 


caggagcccc 


ggaggcccag 


gacacaggat 


660 


tagaaatcta 


gcactggaga 


aacgaggaaa 


720 


ctgaggaagc 


ttttctactc 


agcagatgct 


780 


agagaagaag 


gaatcgatga 


gctagccaaa 


840 


atgacacggc 


gagccatggg 


cggggcagtg 


900 


gccctcatcc 


agccctccag 


ggagcaggtg 


960 


ctgacccccg 


gcataaggga 


gctggtaagt 


1020 


ctaatatctg gtggctttag 


gagtattgta 


1080 


gcaaccaatg 


tatttgccaa 


taggctgaaa 


1140 


gatgagacgc 


agccaacagc 


tgaatctggt 


1200 


gaaaaatttc 


attttaagaa 


aataatcatg 


1260 


tgtcctcctg 


ctgatgcttt 


cattggattt 


1320 


gataacgcca 


aatggtatat 


cactgatttt 


1380 


catccattgt 


cgtacagctc 


caaacaactt 


1440 


atactgtttg 


cttacagttg 


cctattacaa 


1500 


gtactttaaa 


ctacagttag 


gaatcctaga 


1560 


ttccagtatt 


atatgatgac 


tattgatttc 


1620 
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ctggagaggt 


tttttttttt 


tttgagacag 


aatcttgctc 


tgttgcccag 


gctggagtgc 


1680 


agtggcgcgg 


tctcggctca 


ctgcaagctc 


tgcctcccag 


gttcacgcca 


ttctcctgcc 


1740 


tcagcctccc 


gagtagctgg 


gactacaggc 


gcccgccacc 


acatccggct 


aattttttgt 


1800 


atttttagta 


gagacggggt 


ttgaccgtgt 


tagccaggat 


ggtcttgatc 


tcctgacctt 


1860 


gtgatccgcc 


tgcctcagcc 


tcccaaagtg 


ctgggattac 


aggcttgggc 


caccgcgccc 


1920 


agccaatgtc 


ctagagagtt 


ttgtgatctg 


aattctttat 


gtatatttgt 


agctatattt 


1980 


catacaaagt 


gctttaagtg 


tggagagtca 


attaaacacc 


tttactctta 


gaaatacgga 


2040 


ttcggcagcc 


ttcagtgaat 


attggtttct 


ctttggtatg 


tcaataaaag 


tttatccgta 


2100 


tgtcagaacg 


gatttgtgga 


attttcagtt 


ctgcctcttg 


catattggaa 


tacccaatat 


2160 


ttcaggtacc 


gtgttgaagc 


ataaatagcc 


cccaggcaag 


aaacaggctt 


aagactctac 


2220 


tttgcttagt 


gctgggtaca 


atgtcctcac 


cccatgagcc 


cactcccact 


tcctgaagcc 


2280 


ctgggacctg 


gaaagtcctt 


ttttttactt 


cttgggattt 


gtcatatatg 


aaatgttaca 


2340 


aaagtttttt 


gttttttgtt 


ttcctccaat 


ataacaatag 


agtcttaaac 


aaaagtactg 


2400 


attaaaaatg 


gatatttggg 


ctcaggcctg 


taatctcagc 


actgtgggag 


gacaagttaa 


2460 


gaggatcact 


tgaggacagg 


aattcgagac 


cagcctggac 


aacagagcaa 


gaccctgtct 


2520 


ctgcaaaaaa 


taaaaacatt 


agctggatgt 


ggcggcacgc 


acctgtagtc 


ccagctactt 


2580 


ggtgggctga 


gatgggagga 


tcccttgaac 


tcagtagttc 


aaggctgcag 


tgagctgtga 


2640 


ctctgccatt 


gtactccagc 


ctgggtgaca 


gtgagacccc 


gtctcttaaa 


aaaaaaaaaa 


2700 


aaaaaaaaga 


tttttgcaaa 


atcccttaaa 


gttaaaagaa 


ttaaaaacag 


gttttactaa 


2760 


ttcacaaggg 


ccagcatttt 


cccctaaatg 


aattaaattt 


taataactcc 


taagaggcta 


2820 


ttataaacca 


tttctttttt 


ataaatgcct 


tggaaatact 


ccgaaagcca 


ggtttatgtt 


2880 


tgttttgagc 


agagccaagc 


aagggaaatt 


tcagaacatg 


ctttgaatgt 


gtgagatgct 


2940 


ataccctccc 


tggaggaaaa 


atgggggg 








2968 



<210> 27 

<211> 2664 

<212> DNA 

<213> Homo sapien 

<400> 27 

tatatcaact ttctgtgtca agatctttag gtgaagaaca gttttctgtg ggatctgtca 60 

acttcatgaa acagaccaat atccagaaaa atactaatac cagagataca agtaaaaaaa 120 

caaaagacca gctcatcatc gacgctggtc agaaacattt tggggctact gtgtgcaagt 180 



cttgtggtat gatatatact gcttccaacc ctgaagatga aatgcagcat gtacagcatc 



240 
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a. rpapanrrfr 1 f- 


c c c gga agga 


aucaaacacg 


tgggttggaa 


gaaagaacgt 


gtagtagcag 


300 


a ort* t" ^ t" or nria 


egggaaaa c c 


—.4- 1- — — - 4- 4- _ 
gcgccggccc 


tgccacatga 


tccaagcttt 


gctatcaaaa 


360 


a fTrrt" a na a or 2a 


tgcccaagaa 


cttgttgata 


atgaa t tggg 


cttccagcaa 


gttgttccta 


420 


ctct LyLCCdaa 


caaaataaaa 


_,_4-4-4-4~^.4~^4_ 

acttttcctt 


ttatatctga 


tgaaaagaga 


gtagttgggt 


480 


oft* t~ f* n — > fr* fr* oro« 
yuL.LadLL.yc 


agaaccca c c 


aaacaggcat 


ttcgtgtcct 


gtctgaacca 


attggtccag 


540 


act l. UL-L-Uaay 


01 4* ^ 4" 1 a ^1 f*r — v 

CLCLacygaa 


tgtcc taggg 


cfctggcaatg 


mm km ^m mm wi n X— J_ 

ttcagatgta 


ccagaacctg 


600 


c ag t c t g t g g 


gataagtaga 


atctgggttt 


tcagactgaa 


gagaagaaag 


cgcattgcaa 


660 


gacgac cggc 


tgataccctc 


aggaattget 


tcatgtttgg 


ctgttttctc 


agcactgatg 


720 


aaat~aoro»at-fr~ 
dad UdyudLL 


t tctgaccca 


acaccagatg 


^wmm — ^ ~— 4- — 4- 4_ 

gcaagttatt 


tgcaaccaag 


tactgeaaca 


780 


CCCCtddLU U 


ectegtatat 


aattttaata 


gttaaagctg 


atttcagtta 


taaaggagtt 


840 


— i /--j ^ --a 4* ^* 

aCuatCuyya 


taagt tcaaa 


gagctcctta 


ttataaaata 


caaactattt 


aatatcaaaa 


900 


LddddddtdC 


cgagactcac 


actcatacac 


acacacacac 


acacacacac 


acacacatat 


960 


CclCcty t. u.yty 


t tcct tatga 


gt tgaaaagt 


caggaataaa 


tttgttgaaa 


attatctggg 


1020 


Of O ^ 4^ ^•la a 0 o»or 

gduuCaadyy 


aaaaatctt t 


ggggtgattc 


cctgattagc 


actctgaatg 


tttaattatg 


1080 


dddCULuyUd 


get at aactg 


gaaaattacc 


tgactctttg 


taagagtatt 


aaatacaaag 


1140 


4-Ora» 4- 4- 4- 4- 4- n 4- 

cyaLCCt net 


ctagaaatgt 


gacctggtct 


tttataaagc 


ccactcttag 


accaggatta 


1200 


tetaatgeca 


catcagaagc 


aaacaggcaa 


atttaaactt 


gggcaagtaa 


tttctgtgcc 


1260 


caacctgtaa 


agggaatfcee 


tgaatttttt 


ttttttttaa 


tagaggcatg 


ggtctcactg 


1320 


cgt, cgcccag 


gctggtctga 


aacttttggg 


etcaagegat 


cctcccaaaa 


cgctgggatt 


1380 


ar , anr , f"at7 ora 
dCdytCotyo 


yccacCy CyC 


ccagcctaat 


tcctgacttc 


tctatacaga 


gtcttcactt 


1440 


or a 4^ s n 4* 

gacagycacc 


cgtctgtagt 


aactcagttt 


gaatatcttt 


agaaaatgtt 


tagaatttat 


1500 


4~ 4~ fr* s 0 ^ ^ ^-?r 

ttyCaaCady 


atggtaagga 


ataagattat 


cccatatgca 


tttctgtaga 


gcagaatttg 


1560 


dLayCCLdyt 


gt tcaatctt 


4* 4* Mm _ri_L j-i. _ 

tttgaaaata 


aatgtttacc 


tgtcatcaga 


tttaattaaa 


1620 


at t at act ta 


gtaattgeae 


tattacttag 


ttaatttttg 


ttgtatggaa 


atattggtag 


1680 


4" a 4* a 4» 4» 

tdCLdCtuLg 


ggaacctgtt 


actgacaatt 


gatgtcatta 


acaaaatgee 


tagttggatt 


1740 


a ora V of 4- 4- 4- 4- /■» 

ay a ty it ucc 


attttccaat 


4. x_ 4. A- — « 4_ 4_ 

tttttgcttg 


tttaaaatgc 


accttacttg 


ttctgagata 


1800 


cctggcaaaa 


gtctttacaa 


aatgtatggt 


aatagaacca 


aggttagtaa 


atatacatag 


1860 


gctggtggat 


gagagaccat 


ggaactgtgt 


aaatacactt 


aaatgttcac 


acatttttct 


1920 


agtgtaattc 


ttggatactt 


taaaaagcaa 


aacattgttc 


aaattgtttt 


gattctgaaa 


1980 


aatcattcaa 


ctgctaactg 


gcaataagac 


tctaggcaag 


tegttttcca 


gattgtaatt 


2040 


atatgtagaa 


actattcatc 


tgcattcatt 


ttatttgect 


gtaagttaac 


atgtttccaa 


2100 



WO 2004/050900 



PCT/US2003/040131 



50 



aatttaaaag 


cctgggtccc 


caaaagaatg tggaagtatt 


aaaatgtatg 


taattatgca 


2160 


aacattttaa 


tgctattttc 


tgcacttatt tcttttaaat 


attttattta 


aaatttttaa 


2220 


ttaacatttt 


gtttgcttaa 


tgcttttgtt atgaatcaat 


taaaattctt 


tattttatac 


2280 


aactaaatct 


gattttaatt 


attttattat gaaaatagta 


tatgttaaat 


aaaattcaga 


2340 


taatacagaa 


atagagattg 


ccaaaagaag aggcttcccg gccgggcgcg 


gtggctcacg 


2400 


cctgtaatcc 


cagcactttg 


qqaggccgag gcgggtgqat 

33 33 ^ ^ 3 333 33 


catgaggtca 


ggagatcgag 


2460 


accatcctgg 


ctaacaaggt 


gaaaccccgt ctctactaaa 


aatacaaaaa 


attagccggg 


2520 


cgcggtggcg 


ggcgcctgta 


gtcccagcta ctcgggaggc 


tgaggcaqqa 


gaatggcgtg 


2580 


aacccgggag 


gcggagcttg 


cagtgagccg agattgcgcc 


actgcagtcc 


gcagtccggc 


2640 


ctgggcgaca 


gagcgagact 


ccgt 






2664 


<210> 28 

<211> 1757 

<212> DNA 

<213> Homo sapien 










<400> 28 
ctagcatcat 


aggtcaaaga 


cgttattaag tatcgactct 


gaaaaagcaa 


ctataaatac 


60 


tgcaaatcga 


ctaagccaga 


ggacgcaaat atatcctgcg 


tgttgtgtgc 


tggactgtgg 


120 


ttgagacttc 


atggtcgtgc 


tctgggctca tgggggcagc 


tgactgctca 


taattaagat 


180 


tgctgttgcc 


gctgcctgtg 


ttccaaagag aacacaagag 


gacagagcta 


aggtgtagga 


240 


atgcggaaga 


acttgcagaa 


actagagcta gtcttcggat 


tgaaatgaat 


taccaataaa 


300 


gggttgggaa 


atggatcaag 


gagagggaag cgttttgatg gtccctgctc 


cgaggttcag 


360 


cccatttgca 


gttgctgata 


accttgtcag gtcaccagat 


aaaggcaagt 


cttgtcaccc 


420 


tcaggcacct 


ggctgcgaag 


taataaaacg gttgccttat 


catctaaagg 


tggagtgagt 


480 


caaacctcag 


cccagggtaa 


ttatagtaaa ctgaaattgc 


ctgtccttta 


caaaaaaaaa 


540 


ggcggaggga 


gagagaaagg 


aaaagggctt aaggcaggaa 


attgtgtttg 


tctttctcct 


600 


tggatttaaa 


aaagaaacga 


gcaaaacagc aaaaagtttt 


ccagggagtg 


agaaaagcag 


660 


cccggctaga 


ctcccctcac 


cctgggttaa caggattgcc 


gacagatcag 


gcccacgtga 


720 


tggtcaagaa 


aagaggctca 


agctgatgaa aatccaagaa 


atcttcttag 


gagccttcct 


780 


ttctaaatgt 


aaatgactga 


gtcattgctt tgctaaggag ggaaggaagt 


tacaatcaca 


84 0 


ccaagtgact 


cattcattgc 


tgttagctcc ggcagttgga 


tgaaaagttt 


tagcaaaagc 


900 


agcaacaaat 


gtactttaaa 


tactagcaca gtaagggaat 


tcctgtcttt 


caggatgaat 


960 


gccatccaca 


caaaagagct 


cctgttgaca tctcatttac 


aatctccccc 


aggacacaga 


1020 
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caagaccctt 


tcaataaatc 


cagctcagaa 


acacctattg 


ttcaaaatct 


tcaactcgcc 


1080 


acaggctacc 


accactccct 


atggctttgc 


aaaattaaag 


atttagaaga 


aggatggggc 


1140 


ggtggaagct 


atgaaaagag gcaggaaaaa 


agttcctttg acccgatgct 


gtcagagagc 


1200 


gtgcatgaag 


aagaaagtta atggtatttc 


catttatata 


agaagcgcct 


aagaaatgcc 


1260 


tgtgacgttc 


gtgaactagt 


gattgtgaat 


tccaaatttg atgccaactt 


tatgtgtaaa 


1320 


gaagctaact 


cctgccaaca 


tcgtggctga 


atgaacagct 


gggactatgc 


ttaacccatt 


1380 


cccagcttat 


aaaagcccca 


tggcagctgc 


agtgaagcat 


cagaaaagta 


tagtaagaag 


1440 


aaactgaatt 


tgaagtggat 


tcttacaaag 


gaaaaagaaa 


atcactattg 


taactatacc 


1500 


aaattactat 


gtatgtgagc 


tcccagcttt 


gacatcaagt 


cgttacagcc 


acgtaatggg 


1560 


attacgggtg 


atttttactt 


tattctttgg gcttctctga 


atttfccctat 


attttgaaaa 


1620 


agcacctgca 


ttattttgta 


attataaaaa agtataaata agaattatac 


acactattat 


1680 


ttcttcactg 


ctatgacatg 


actagattac 


aatttttttc 


tttttgagat 


gactttcggt 


1740 


cttgtttccc 


aggctgg 










1757 


<210> 29 

<211> 1299 

<212> DNA 

<213> Homo sapien 












<400> 29 
caggacacct 


tgggccagct 


tggttttact 


ttagatttca 


ctgttgtccc 


accccacttc 


60 


ttccttcacc 


aacatgcaag 


ttctttcctt 


ccctgccagc 


cagatagaca 


gatgggagag 


12 0 


gcagtcgcgg 


ccttcgttgt 


cagtagttct 


ttgatgtgaa 


aggggcagca 


cagtcattta 


180 


aacttgatcc 


aacctctttg 


catcttacag agttaaacag 


ctaaaagaag 


taaaataaga 


240 


aggcaatgct 


tgtggaatgt 


acagtgcata 


ttgacggcac 


acgcctcatt 


acgattcgcc 


300 


tgcttgcttc 


tcctgttcaa 


tcgtttcttt 


ggaaggcagt 


ggatttttct 


cttgcgtctc 


360 


tgtcttcttc 


agtttcgact 


tatcgaattt 


ctcgatctca 


gccatatcgg 


atttgtcaga 


420 


catggttacg 


gaggagaagc 


ggagcgaggc 


acgcgagaac gagcgaagtc 


tggccttcgt 


480 


caagatggtt 


ggacaccacg 


tcgccttcct 


ggaagcagac 


gtgcttcagg 


ctgagcggga 


540 


ccatggggcc 


ttccctcagg ccctgcggag gtggctggga 


tccgcagggc 


tcccctcctt 


600 


caggaacaag 


tcacctgcac 


cggtgcccgt 


gacgtacgag 


ctgcccacac 


tgtataggac 


660 


ggaggactat 


tttcctgtgg acgccgggga 


agcacagcac 


cacccccgca 


cctgccctcg 


720 


gcctttgtga 


gctttgtggt 


cttcccatca 


ggaacgctgg 


aaagtgacat 


tgtgtacaca 


780 


ctgcagcttg 


ggggtttttt 


ctttgtattg ctgtttattt 


tatattttaa 


aaatatttaa 


840 
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aaaaatgtcg agatggggtc tcactatgtt gtccagactg atctcaaact cctgggctca 900 

agtgatccac ccaccttggc cttccaaagt ggtgggatta tgggcaggag cctccgtgcc 960 

caggctgctg ccattttcaa atttcctccc tgcctcatgt gagaccacag ggtttggaga 1020 

agcagttgga acccacgtgt ggtgatgcct cccacatcgg cctgcttggg gttctacagg 1080 

ggttgaggga ccaggcctgg ccggggctga tggacagtgg ggactttcct tctctccatg 1140 

atggctttgc aggggctcca tggtccttct ctctgtgatg ggtttttgca cggggtgtgc 1200 

tctgccactg tggtgggtgg gtggatgctg cttctgttgc ctccagacct cggtgcccac 1260 

agccttgagg atccttccaa taaaggtgtc aagagctca 1299 

<210> 30 

<211> 2456 

<212> DNA 

<213> Homo sapien 

<400> 30 

gggttcacaa gcgagagcag ggggcacagc ttccacccgt cctagccagg gtgagaccac 60 

cctgcctgct ggtggagcca ggcgggccca gagcccacct ggaagatgga ggctgcacgg 120 

cactgcgtgg tccggggcag ccccagggca gcagagcatt ccctggcctt ccctgctggt 180 

gccagctcct taccacagag acgccgcgtg gaactcacta ctggcgatcg cggacgcccc 240 

aggaaggcga gtggcacgag gtgtggcggg gctgccacag gcagtcacgg agcacccgct 3 00 

ggtgccaggc tcacccctca gatacagcct cgtgggctgg ctcagtggcg gccctgactc 3 60 

tccaaggtct gaggtctgct qtgtcttcca ttccatgact cacaggaggt tcaaggttac 420 

ttccactgtg gcagctgcct ccctcctccc tctccaggat ggaaggaggt cctgctgtgc 480 

aaacctgcct ggctgtcccc ttccgggaca cggacagggg gtttcctggc agttcctggt 540 

cctcccttga gggctaaggg tccccccgtt ctgtggccgc ctccagcaca tccacccaga 600 

gtccctgggc gtgaacacag ccgctgggga cgatcaccgc ctgcccagcg tgcagccctg 660 

gggctgcgtc cttatctccg ctcctcctgc ctcctgccca gctgttcaac gtctacccat 72 0 

ggctcggggc cctgctccag ctgcaccggc ccgtcctgcg caagatcgag gaggtccgtg 780 

ccattctgag gaccctcctg gaggcgcgga ggccccacgt gtgcccgggg gaccccgtgt 84 0 

gcagctatgt ggacgccctg atccagcagg gacaggggga tgaccccgag ggcctgtttg 900 

ctgaggccaa cgcggtggcc tgcaccctgg acatggtcat ggccgggacg gagacgacct 960 

cggccacgct gcagtgggcc gcacttctga tgggccggca cccggacgtg cagggtgaga 1020 

ccccggcgcc tggcgagacg gctccttctg cccccggggg acccccaggg acgagggatg 1080 

gcgctgccac ccaagcggcc caccctttgc cccaggccgg gtgcaggagg agctagaccg 1140 
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cgtgctgggc 


cctgggcgga ctccccggct ggaggaccag caggctctgc cctacacaag 


1200 


cgccgtgctc 


cacgaggtgc agcggttcat cacgctcctg ccgcacgtgc cccgctgcac 


1260 


cgcggccgac acacagctgg gcggcttcct gctccccaag ggcacgcccg tgattcccct 


1320 


gctgacctcg gtgctcctgg atgagacaca gtggcagacc ccaggccagt tcaaccccgg 


1380 


ccatttcctg gacgcgaatg ggcactttgt gaagcgggag gccttcctgc ctttctctgc 


1440 


aggccgccgc 


gtctgtgttg gggagcgcct ggccaggacc gagctcttcc tgctgtttgc 


1500 


cggcctcctg 


cagaggtacc gcctgctgcc cccgcctggc gtcagtccgg cctccctgga 


1560 


caccacgccc 


gcccgggctt ttaccatgag gccgagggcc caggccctgt gtgcggtgcc 


1620 


caggccctag gagctccccc agcccccagg tcctcctgac cactcccctc ccagccctgg 


1680 


gtcctcccac 


cctctctcct cccaccccac agctcggact gctctgggag ggccctgagg 


1740 


actcccaccc 


tcacccccac ccccacaggg tcagcaactg cttccggtta cacccaggac 


1800 


tacccctgcc 


cgaccctgtg ggacccccac ccctctgatg ctgtctgcag ctcagtccct 


1860 


gccagccccc 


aggagcgcct ccagggcccc gcccactctc ccacccctga agctgcactc 


1920 


ccacccacct 


agctcccccc agggcccccc agcacctaca gctggggctg cagggagaca 


1980 


acgggtggct 


gcatccagcc agagacaggc gcaggtgggt gtcctcagcg tgcgagccct 


2040 


gcacccccca 


ggtcctggga ctcctgcaga ccccactcca ttcccgctcc tggaacactt 


2100 


cctgcagctg 


tgcctggagg cagtcggcct gcagtgccag actctgagcc aagccactgg 


2160 


ggccatgcgt 


atgactggtg cagggaggca aggcccacat tctccttcag agacaggcac 


2220 


tggcgccaga ggcttccttg gggcgggggg agggcacctc agcccctgaa gacaagcagc 


2280 


actgcagtgg 


caaaaatgga aacactgacc cggtgcggtg gctcatgcct gtaatcccag 


2340 


cactttggga 


ggccgaggca ggcgaatcac gaggtcagga gttcgagacc agcctgccca 


2400 


acatggtgaa 


accctgtctc tactaaaaat acaaaaaaat taaccgagca tggtgg 


2456 



<210> 31 

<211> 642 

<212> DNA 

<213> Homo sapien 

<400> 31 

gggaaaggcc atgttccctc ctctgtgcca tgtccccacg tctcaggcct ttgtcaacga 60 

atccctcagc ccggacagag ggccgtggca tggctaatgg cttggagaag cagctgaggt 120 

gctgctttat ggcataaggt gagcgtcctt tgtcttggag tcctctgggc aggggacact 180 

tgtgcctggt gcccctggga ggaaggagag gagcatgtgc gggaaagagc aggaggccga 240 

ggtgggctga tcacgaggtc aggagttcga gaccagcctg gccaacatgg cacacatggc 300 
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acgcacatcg tggcgatgtc gcgtgccatg tgtctgatgc tgctgagcgt ggcgcgagca 



360 



ttcttgttga tggtggtccg cacataggag gtggcaggct tccgctggcc ggatctccgc 



420 



ttaatgacca ccacgacacc tttgccgtcg gctgccggct ccacgcccac agtcttgcgg 



480 



tgaatcagtc cgttgtagcg gaaggaattg cgggccctca agttattggg ctcagtgctg 



540 



tagtttttgg attatcattt taaaacatgg aaaactggag cagttccgca cgaccaatcc 



600 



agttgctaat gcctgagcat cggtacgcag acatacgcca gc 



642 



<210> 32 

<211> 3907 

<212> DNA 

<213> Homo sapien 

<400> 32 

gcgaattggg cctctagacg catgctctag caggcggcgc agtgtgctgg aaaggaccaa 60 

tcgacctgag gacttgcgag ccgctcagct cccgggacgt ttggagctgc tgctaaataa 12 0 

tttctgctca gccatgtcgc cggctccaga tgcagccccg gctcctgcgt cgatctccct 18 0 

gtttgacctc agcgcggatg ctccggtctt tcagggcctg agcctggtga gccacgcgcc 240 

tggggaggct ctggcccggg ctccgcgtac ttcctgttca ggctcagggg agagagaaag 3 00 

cccagaaaga aagctactcc agggtcctat ggatatttca gagaagttat tttgttcaac 360 

ttgtgaccag accttccaga accaccaaga acagagggaa cattataagc ttgactggca 42 0 

tcggtttaac ctaaagcaac gtctcaagga caagcctctc ctgtctgccc tggactttga 480 

aaagcagagc tccacaggag atctttccag catctcggga tcagaagact cagactcagc 540 

cagtgaggag gacttgcaga cactggatcg ggagagggct acatttgaga agttgagccg 600 

acccccaggc ttttaccctc atcgagttct tttccagaat gcccagggcc agtttcttta 660 

tgcctaccgc tgtgtcctag gccctcatca ggatccccca gaagaggcag aactgctgct 720 

acagaacctg caaagtagag gtcccagaga ctgcgtggtg ctcatggctg cagctgggca 780 

ctttgctggt gctatatttc aaggaagaga agtggtgaca cacaaaactt ttcaccgcta 840 

tacggttcgg gccaagcggg gcacagccca ggggcttcgg gatgcccgag gtgggccatc 900 

acactctgct ggagccaacc tgaggcgcta caatgaagcc acactatata aggatgttcg 960 

tgacctgctg gcagggccaa gctgggctaa ggcgctggag gaggctggta caatactgtt 1020 

gcgtgctccc cgctctggcc ggtctttgtt ctttggaggc aagggagcaq ccctgcaaag 1080 

gggggatccc cgactttggg atatccccct cgccacccgc agacccacct tccaagagct 1140 

acagcgtgtg ctccataagc tgaccacttt gcatgtctat gaagaagacc ctcgggaagc 1200 

agtcagactg cactcacctc agacacactg gaaaacagta agagaggaga gaaagaagcc 1260 
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tactgaggaa gaaataagaa 


agatctgcag ggatgaaaag 


gaagcgctgg ggcagaatga 


1320 


ggaatctccc 


aaacagggtt 


tgattactat ctggcaactg 


tcagatctga gtttctgtcc 


1380 


taaaaatgca 


ctggcaaatt 


ccctactctc ataactgacc 


catttcctta tttaaacaca 


1440 


cacccacagt gtctatctta 


gcaattactg agacatgttt 


agtgaccttt ccaaatctat 


1500 


tatctctttt 


gttatggatg 


tttccccaga gctaagaaaa 


tctttgtgtg gagcagatgc 


1560 


gtggatgtgt 


tgcatggatc 


tcacatagtt tatctttgat 


tggatgctgt ggtttgaggc 


1620 


tgagggctgc 


ttccatgctt 


tcctctattg tgcggcaccc 


taaggctact catacggaga 


1680 


gtccagggac 


ttgttcttgt 


ctgggctgag agcctcattt 


ctgcatgtga tataatggac 


1740 


tcagctcttg 


atgtgctact 


tttatctggg tgacaaaata 


aaaacaattt catttcaggc 


1800 


tttcattctt 


atgcatctgc 


ttctaccctc tgagtactcc 


ctagatggat ttcacatgtc 


1860 


aggtttttcc 


ctaggttcag ggtcggaggg agaagatggc tttcaggtag agttggagct 


1920 


agtggagttg 


actgtgggga 


ctctggatct ttgtgagtct 


gaagtattgc ccaagcggag 


1980 


gaggagaaaa 


aggaataaga 


aggagaaaag ccgagaccag 


gaggctgggg cacatcggac 


2040 


tcttctccag 


caaactcaag aagaggagcc ttccacacag 


tcatcccagg cagttgctgc 


2100 


ccccttgggc 


cctttgctgg 


atgaggccaa agcccctggt 


cagccagagc tctggaatgc 


2160 


actgcttgct 


gcttgccgag 


ctggagatgt tggagtgcta 


aagctgcagc tagctcccag 


2220 


ccctgcagac 


cctagagttc 


tgtctctgct cagtgccccc 


ttgggctccg gtggctttac 


2280 


tctcctgcat 


gcagcagctg 


cagctggaag aggctcagtg 


gttcgtctgc tgctggaagc 


2340 


aggtgctgac 


cccactgtgc agtgagtaaa ggtccccatc 


ctgagtcatt ttgggtatag 


2400 


agaggataat 


ttggaggtta aagttggcat tggctacttg gcaatattcc cttggtctgg 


2460 


ttgggtgatg 


ttgggaaagg 


caggaggcaa gttaaaccca 


tttttggagt ttttgcacct 


2520 


agggactctc 


gggcccggcc 


accttatact gttgcggctg 


acaaatcaac acgtaatgag 


2580 


ttccgaaggt 


tcatggagaa gaatccagat gcctacgatt 


acaacaaggc tcaggtcatc 


2640 


tggaatagga 


aggacaagca 


gagtgggaag gcatcacaga 


tcctggctag atcctttcta 


2700 


cttcttgcag 


ttttaccaaa 


gataacttat agggtcttat 


atttgaaatc ctccaggtgc 


2760 


caggaccatt 


gacaccagaa 


atggaggcac ggcaggctac 


acggaaaagg gagcagaagg 


2820 


cagcccggcg 


gcaacgggag 


gaacagcagc agaggcagca 


ggagcaggag gagcgtgaac 


2880 


gagaagagca gcggcgattt gccgccctca gtgaccgaga gaagagagct ctggctgcag 


2940 


agcgccgact 


cgctgcccag 


ttgggagccc ctacctctcc 


aatccctgac tctgcaatcg 


3000 


tcaatactcg acgctgctgg 


agttgtgggg catccctcca 


aggcctgact ccctttcact 


3060 
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ggccctcttc 

cattcacagc 

gggtgaagga 

tgtctctgga 

cagaggcact 

ccactccatg 

gtgtctcact 

acaaaagaaa 

taacaagatt 

ctgggctccc 

ggccgggtat 

aggttcagaa 

aggaaccagg 

ttccctc 



ctctttctgc 

ctgatctctt 

agccctaggt 

cactcgggaa 

attttaatca 

gtggccttta 

tcccaggtcc 

gtcagccagg 

tgtaaaatct 

cccaggtgat 

tgaggacagg 

atagtaggtt 

ccaggggaaa 

tcacaagaag 



56 

tccacacgtt gcctccagga 
acagctctac ctggggccaa 
tttttcttcc ccgtgaaacc 
ctagggcaaa gacagggcta 
caataaagtt tggcaaggaa 
gttccttagg gtcaccgtgg 
aagggttact tttttttttt 
ctggaagcga gactctgtct 
cagatcctac ctcagaccta 
cctaggcacc ttcaaaattg 
gagtgccctg ttcatctcaa 
aaatctggat gaatcaatgt 
ccattggaaa atcttcaggg 
gacctctttc ctggagagtc 



tcatcgccgt caggcaggga 3120 

ctcagggacc tgagagggca 3180 

agagatgatt tggaagatgg 3240 

gaggtatgtg gagctggtac 33 00 

tgtgtacttg tacttacatt 3360 

ccagcaccaa gggatcctgc 3420 

tttttttttt ttgagatgga 3480 

caaaaaacac aaacaaacaa 3540 

tagaatcaga atttgtattt 3600 

agaagcatgg ctttacttga 3660 

atcctcagcc tctagcacag 372 0 

cataaccaaa aggctaaaac 3780 

tgatccgggg tgtggaaggg 3 840 

ctgtttcagc aaaacagcca 3900 

3907 



<210> 33 

<211> 3608 

<212> DNA 

<213> Homo sapien 

<400> 33 

gcgaattggg cctctagacg catgctctag caggcggcgc agtgtgctgg aaaggaccaa 
tcgacctgag gacttgcgag ccgctcagct cccgggacgt ttggagctgc tgctaaataa 
tttctgctca gccatgtcgc cggctccaga tgcagccccg gctcctgcgt cgatctccct 
gtttgacctc agcgcggatg ctccggtctt tcagggcctg agcctggtga gccacgcgcc 
tggggaggct ctggcccggg ctccgcgtac ttcctgttca ggctcagggg agagagaaag 
cccagaaaga aagctactcc agggtcctat ggatatttca gagaagttat tttgttcaac 
ttgtgaccag accttccaga accaccaaga acagagggaa cattataagc ttgactggca 
tcggtttaac ctaaagcaac gtctcaagga caagcctctc ctgtctgccc tggactttga 
aaagcagagc tccacaggag atctttccag catctcggga tcagaagact cagactcagc 
cagtgaggag gacttgcaga cactggatcg ggagagggct acatttgaga agttgagccg 
acccccaggc ttttaccctc atcgagttct tttccagaat gcccagggcc agtttcttta 
tgcctaccgc tgtgtcctag gccctcatca ggatccccca gaagaggcag aactgctgct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 

720 
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acagaacctg caaagtagag gtcccagaga ctgcgtggtg ctcatggctg cagctgggca 780 

ctttgctggt gctatatttc aaggaagaga agtggtgaca cacaaaactt ttcaccgcta 84 0 

tacggttcgg gccaagcggg gcacagccca ggggcttcgg gatgcccgag gtgggccatc 900 

acactctgct ggagccaacc tgaggcgcta caatgaagcc acactatata aggatgttcg 960 

tgacctgctg gcagggccaa gctgggctaa ggcgctggag gaggctggta caatactgtt 1020 

gcgtgctccc cgctctggcc ggtctttgtt ctttggaggc aagggagcac ccctgcaaag 1080 

gggggatccc cgactttggg atatccccct cgccacccgc agacccacct tccaagagct 1140 

acagcgtgtg ctccataagc tgaccacttt gcatgtctat gaagaagacc ctcgggaagc 1200 

agtcagactg cactcacctc agacacactg gaaaacagta agagaggaga gaaagaagcc 1260 

tactgaggaa gaaataagaa agatctgcag ggatgaaaag gaagcgctgg ggcagaatga 1320 

ggaatctccc aaacagggtt tgattactat ctggcaactg tcagatctga gtttctgtcc 138 0 

taaaaatgca ctggcaaatt ccctactctc ataactgacc catttcctta tttaaacaca 1440 

cacccacagt gtctatctta gcaattactg agacatgttt agtgaccttt ccaaatctat 1500 

tatctctttt gttatggatg tttccccaga gctaagaaaa tctttgtgtg gagcagatgc 156 0 

gtggatgtgt tgcatggatc tcacatagtt tatctttgat tggatgctgt ggtttgaggc 162 0 

tgagggctgc ttccatgctt tcctctattg tgcggcaccc taaggctact catacggaga 1680 

gtccagggac ttgttcttgt ctgggctgag agcctcattt ctgcatgtga tataatggac 1740 

tcagctcttg atgtgctact tttatctggg tgacaaaata aaaacaattt catttcaggc 1800 

tttcattctt atgcatctgc ttctaccctc tgagtactcc ctagatggat ttcacatgtc 1860 

aggtttttcc ctaggttcag ggtcggaggg agaagatggc tttcaggtag agttggagct 1920 

agtggagttg actgtgggga ctctggatct ttgtgagtct gaagtattgc ccaagcggag 1980 

gaggagaaaa aggaataaga aggagaaaag ccgagaccag gaggctgggg cacatcggac 204 0 

tcttctccag caaactcaag aagaggagcc ttccacacag tcatcccagg cagttgctgc 2100 

ccccttgggc cctttgctgg atgaggccaa agcccctggt cagccagagc tctggaatgc 2160 

actgcttgct gcttgccgag ctggagatgt tggagtgcta aagctgcagc tagctcccag 222 0 

ccctgcagac cctagagttc tgtctctgct cagtgccccc ttgggctccg gtggctttac 2280 

tctcctgcat gcagcagctg cagctggaag aggctcagtg gttcgtctgc tgctggaagc 2340 

aggtgctgac cccactgtgc agtgagtaaa ggtccccatc ctgagtcatt ttgggtatag 2400 

agaggataat ttggaggtta aagttggcat tggctacttg gcaatattcc cttggtctgg 2460 

ttgggtgatg ttgggaaagg caggaggcaa gttaaaccca tttttggagt ttttgcacct 2520 

agggactctc gggcccggcc accttatact gttgcggctg acaaatcaac acgtaatgag 258 0 
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ttccgaaggt 


tcatggagaa 


gaatccagat 


gcctacgatt 


acaacaaggc 


teaggtgeca 


2640 


ggaccattga 


caccagaaat 


ggaggcacgg 


caggctacac 


ggaaaaggga 


gcagaaggca 


2700 


gcccggcggc 


aacgggagga 


acagcagcag 


aggcagcagg 


agcaggagga 


gcgtgaacga 


2760 


gaagagcagc 


ggcgatttgc 


cgccctcagt 


gaccgagaga 


agagagctct 


ggctgeagag 


2820 


cgccgactcg 


ctgcccagtt 


gggagcccct 


acctctccaa 


tccctgactc 


tgcaatcgtc 


2880 


aatactcgac 


gctgctggag 


ttgtggggca 


tccctccaag 


gcctgactcc 


ctttcactac 


2940 


ctcgacttct 


ctttctgctc 


cacacgttgc 


ctccaggatc 


atcgccgtca 


ggcagggagg 


3000 


ccctcttcct 


gatctcttac 


agctctacct 


ggggccaact 


cagggacctg 


agagggcaca 


3060 


ttcacagcag 


ccctaggttt 


tttcttcccc 


gtgaaaccag 


agatgatttg 


gaagatgggg 


3120 


gtgaaggaca 


ctcgggaact 


agggcaaaga 


cagggctaga 


ggtatgtgga 


gctggtactg 


3180 


tctctggaat 


tttaatcaca 


ataaagtttg 


gcaaggaatg 


tgtacttgta 


cttacattca 


3240 


gaggcactgt 


ggcctttagt 


tccttagggt 


caccgtggcc 


agcaccaagg 


gatcctgccc 


3300 


actccatgtc 


ccaggtccaa 


gggttacttt 


tttttttttt 


tttttttttt 


gagatggagt 


3360 


gtctcactgt 


cagccaggct 


ggagtgcagt 


ggtacaacct 


ccactcactg 


caacctctgc 


3420 


ctcctggatt 


caagcaattc 


tcctgccctc 


agcctcccaa 


gtagctggga 


ttagaggtgc 


3480 


ccaccatcac 


gcccggctaa 


tttttgtatt 


tttttgagat 


ggagtgtctc 


actgtcagcc 


3540 


aggctggagt 


gcagtggcac 


aatctcggct 


cactgcaacc 


tctgcctccg 


agattcaagc 


3600 


gattctcc 












3608 


<210> 34 

<211> 3550 

<212> DNA 

<213> Homo sapien 












<400> 34 
gcgaattggg 


cctctagacg 


catgctctag 


caggcggcgc 


agtgtgctgg 


aaaggaccaa 


60 


tcgacctgag gacttgcgag 


ccgctcagct 


cccgggacgt 


ttggagctgc 


tgctaaataa 


120 


tttctgctca gccatgtcgc 


cggctccaga 


tgcagccccg 


gctcctgcgt 


cgatctccct 


180 


gtttgacctc 


agcgcggatg 


ctccggtctt 


t cacrcrcrc c t cr 


acre ctcrcrt era 


crccaccrccrcc 


240 


tggggaggct 


ctggcccggg 


ctccgcgtac 


ttcctgttca 


ggctcagggg 


agagagaaag 


300 


cccagaaaga 


aagctactcc 


agggtcctat 


ggatatttca 


gagaagttat 


tttgttcaac 


360 


ttgtgaccag 


accttccaga 


accaccaaga 


acagagggaa 


cattataagc 


ttgactggca 


420 


tcggtttaac 


ctaaagcaac 


gtctcaagga 


caagcctctc 


ctgtctgccc 


tggactttga 


480 


aaagcagagc 


tccacaggag 


atctttccag 


catctcggga 


tcagaagact 


cagactcagc 


540 
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cagtgaggag gacttgcaga cactggatcg ggagagggct acatttgaga agttgagccg 600 

acccccaggc ttttaccctc atcgagttct tttccagaat gcccagggcc agtttcttta 66 0 

tgcctaccgc tgtgtcctag gccctcatca ggatccccca gaagaggcag aactgctgct 720 

acagaacctg caaagtagag gtcccagaga ctgcgtggtg ctcatggctg cagctgggca 780 

ctttgctggt gctatatttc aaggaagaga agtggtgaca cacaaaactt ttcaccgcta 840 

tacggttcgg gccaagcggg gcacagccca ggggcttcgg gatgcccgag gtgggccatc 900 

acactctgct ggagccaacc tgaggcgcta caatgaagcc acactatata aggatgttcg 960 

tgacctgctg gcagggccaa gctgggctaa ggcgctggag gaggctggta caatactgtt 1020 

gcgtgctccc cgctctggcc ggtctttgtt ctttggaggc aagggagcac ccctgcaaag 1080 

gggggatccc cgactttggg atatccccct cgccacccgc agacccacct tccaagagct 1140 

acagcgtgtg ctccataagc tgaccacttt gcatgtctat gaagaagacc ctcgggaagc 1200 

agtcagactg cactcacctc agacacactg gaaaacagta agagaggaga gaaagaagcc 1260 

tactgaggaa gaaataagaa agatctgcag ggatgaaaag gaagcgctgg ggcagaatga 132 0 

ggaatctccc aaacagggtt tgattactat ctggcaactg tcagatctga gtttctgtcc 13 80 

taaaaatgca ctggcaaatt ccctactctc ataactgacc catttcctta tttaaacaca 144 0 

cacccacagt gtctatctta gcaattactg agacatgttt agtgaccttt ccaaatctat 1500 

tatctctttt gttatggatg tttccccaga gctaagaaaa tctttgtgtg gagcagatgc 1560 

gtggatgtgt tgcatggatc tcacatagtt tatctttgat tggatgctgt ggtttgaggc 1620 

tgagggctgc ttccatgctt tcctctattg tgcggcaccc taaggctact catacggaga 16 80 

gtccagggac ttgttcttgt ctgggctgag agcctcattt ctgcatgtga tataatggac 1740 

tcagctcttg atgtgctact tttatctggg tgacaaaata aaaacaattt catttcaggc 1800 

tttcattctt atgcatctgc ttctaccctc tgagtactcc ctagatggat ttcacatgtc 1860 

aggtttttcc ctaggttcag ggtcggaggg agaagatggc tttcaggtag agttggagct 1920 

agtggagttg actgtgggga ctctggatct ttgtgagtct gaagtattgc ccaagcggag 1980 

gaggagaaaa aggaataaga aggagaaaag ccgagaccag gaggctgggg cacatcggac 204 0 

tcttctccag caaactcaag aagaggagcc ttccacacag tcatcccagg cagttgctgc 2100 

ccccttgggc cctttgctgg atgaggccaa agcccctggt cagccagagc tctggaatgc 2160 

actgcttgct gcttgccgag ctggagatgt tggagtgcta aagctgcagc tagctcccag 2220 

ccctgcagac cctagagttc tgtctctgct cagtgccccc ttgggctccg gtggctttac 2280 

tctcctgcat gcagcagctg cagctggaag aggctcagtg gttcgtctgc tgctggaagc 234 0 
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agggactctc gggcccggcc accttatact gttgcggctg acaaatcaac acgtaatgag 2400 

ttccgaaggt tcatggagaa gaatccagat gcctacgatt acaacaaggc tcaggtcatc 2460 

tggaatagga aggacaagca gagtgggaag gcatcacaga tcctggctag atcctttcta 2520 

cttcttgcag ttttaccaaa gataacttat agggtcttat atttgaaatc ctccaggtgc 2580 

caggaccatt gacaccagaa atggaggcac ggcaggctac acggaaaagg gagcagaagg 2640 

cagcccggcg gcaacgggag gaacagcagc agaggcagca ggagcaggag gagcgtgaac 2700 

gagaagagca gcggcgattt gccgccctca gtgaccgaga gaagagagct ctggctgcag 276 0 

agcgccgact cgctgcccag ttgggagccc ctacctctcc aatccctgac tctgcaatcg 2820 

tcaatactcg acgctgctgg agttgtgggg catccctcca aggcctgact ccctttcact 2 880 

acctcgactt ctctttctgc tccacacgtt gcctccagga tcatcgccgt caggcaggga 2940 

ggccctcttc ctgatctctt acagctctac ctggggccaa ctcagggacc tgagagggca 3000 

cattcacagc agccctaggt tttttcttcc ccgtgaaacc agagatgatt tggaagatgg 3 060 

gggtgaagga cactcgggaa ctagggcaaa gacagggcta gaggtatgtg gagctggtac 3120 

tgtctctgga attttaatca caataaagtt tggcaaggaa tgtgtacttg tacttacatt 3180 

cagaggcact gtggccttta gttccttagg gtcaccgtgg ccagcaccaa gggatcctgc 3240 

ccactccatg tcccaggtcc aagggttact tttttttttt tttttttttt ttgagatgga 33 00 

gtgtctcact gtcagccagg ctggagtgca gtggtacaac ctccactcac tgcaacctct 3 360 

gcctcctgga ttcaagcaat tctcctgccc tcagcctccc aagtagctgg gattagaggt 3420 

gcccaccatc acgcccggct aatttttgta tttttttgag atggagtgtc tcactgtcag 3480 

ccaggctgga gtgcagtggc acaatctcgg ctcactgcaa cctctgcctc cgagattcaa 3540 

gcgattctcc 3550 

<210> 35 

<211> 3562 

<212> DNA 

<213> Homo sapien 

<400> 35 

gcgaattggg cctctagacg catgctctag caggcggcgc agtgtgctgg aaaggaccaa 60 

tcgacctgag gacttgcgag ccgctcagct cccgggacgt ttggagctgc tgctaaataa 120 

tttctgctca gccatgtcgc cggctccaga tgcagccccg gctcctgcgt cgatctccct 180 

gtttgacctc agcgcggatg ctccggtctt tcagggcctg agcctgaggg aacattataa 240 

gcttgactgg catcggttta acctaaagca acgtctcaag gacaagcctc tcctgtctgc 300 

cctggacttt gaaaagcaga gctccacagg agatctttcc agcatctcgg gatcagaaga 360 
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ctcagactca gccagtgagg aggacttgca gacactggat cgggagaggg ctacatttga 420 

gaagttgagc cgacccccag gcttttaccc tcatcgagtt cttttccaga atgcccaggg 480 

ccagtttctt tatgcctacc gctgtgtcct aggccctcat caggatcccc cagaagaggc 540 

agaactgctg ctacagaacc tgcaaagtag aggtcccaga gactgcgtgg tgctcatggc 600 

tgcagctggg cactttgctg gtgctatatt tcaaggaaga gaagtggtga cacacaaaac 660 

ttttcaccgc tatacggttc gggccaagcg gggcacagcc caggggcttc gggatgcccg 720 

aggtgggcca tcacactctg ctggagccaa cctgaggcgc tacaatgaag ccacactata 780 

taaggatgtt cgtgacctgc tggcagggcc aagctgggct aaggcgctgg aggaggctgg 840 

tacaatactg ttgcgtgctc cccgctctgg ccggtctttg ttctttggag gcaagggagc 900 

acccctgcaa aggggggatc cccgactttg ggatatcccc ctcgccaccc gcagacccac 960 

cttccaagag ctacagcgtg tgctccataa gctgaccact ttgcatgtct atgaagaaga 1020 

ccctcgggaa gcagtcagac tgcactcacc tcagacacac tggaaaacag taagagagga 1080 

gagaaagaag cctactgagg aagaaataag aaagatctgc agggatgaaa aggaagcgct 114 0 

ggggcagaat gaggaatctc ccaaacaggg tttgattact atctggcaac tgtcagatct 1200 

gagtttctgt cctaaaaatg cactggcaaa ttccctactc tcataactga cccatttcct 1260 

tatttaaaca cacacccaca gtgtctatct tagcaattac tgagacatgt ttagtgacct 1320 

ttccaaatct attatctctt ttgttatgga tgtttcccca gagctaagaa aatctttgtg 1380 

tggagcagat gcgtggatgt gttgcatgga tctcacatag tttatctttg attggatgct 144 0 

gtggtttgag gctgagggct gcttccatgc tttcctctat tgtgcggcac cctaaggcta 1500 

ctcatacgga gagtccaggg acttgttctt gtctgggctg agagcctcat ttctgcatgt 1560 

gatataatgg actcagctct tgatgtgcta cttttatctg ggtgacaaaa taaaaacaat 1620 

ttcatttcag gctttcattc ttatgcatct gcttctaccc tctgagtact ccctagatgg 168 0 

atttcacatg tcaggttttt ccctaggttc agggtcggag ggagaagatg gctttcaggt 1740 

agagttggag ctagtggagt tgactgtggg gactctggat ctttgtgagt ctgaagtatt 18 00 

gcccaagcgg aggaggagaa aaaggaataa gaaggagaaa agccgagacc aggaggctgg 1860 

ggcacatcgg actcttctcc agcaaactca agaagaggag ccttccacac agtcatccca 1920 

ggcagttgct gcccccttgg gccctttgct ggatgaggcc aaagcccctg gtcagccaga 1980 

gctctggaat gcactgcttg ctgcttgccg agctggagat gttggagtgc taaagctgca 2040 

gctagctccc agccctgcag accctagagt tctgtctctg ctcagtgccc ccttgggctc 2100 

cggtggcttt actctcctgc atgcagcagc tgcagctgga agaggctcag tggttcgtct 2160 

gctgctggaa gcaggtgctg accccactgt gcagtgagta aaggtcccca tcctgagtca 2220 
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ttttgggtat agagaggata atttggaggt taaagttggc attggctact tggcaatatt 
cccttggtct ggttgggtga tgttgggaaa ggcaggaggc aagttaaacc catttttgga 
gtttttgcac ctagggactc tcgggcccgg ccaccttata ctgttgcggc tgacaaatca 
acacgtaatg agttccgaag gttcatggag aagaatccag atgcctacga ttacaacaag 
gctcaggtca tctggaatag gaaggacaag cagagtggga aggcatcaca gatcctggct 
agatcctttc tacttcttgc agttttacca aagataactt atagggtctt atatttgaaa 
tcctccaggt gccaggacca ttgacaccag aaatggaggc acggcaggct acacggaaaa 
gggagcagaa ggcagcccgg cggcaacggg aggaacagca gcagaggcag caggagcagg 
aggagcgtga acgagaagag cagcggcgat ttgccgccct cagtgaccga gagaagagag 
ctctggctgc agagcgccga ctcgctgccc agttgggagc ccctacctct ccaatccctg 
actctgcaat cgtcaatact cgacgctgct ggagttgtgg ggcatccctc caaggcctga 
ctccctttca ctacctcgac ttctctttct gctccacacg ttgcctccag gatcatcgcc 
gtcaggcagg gaggccctct tcctgatctc ttacagctct acctggggcc aactcaggga 
cctgagaggg cacattcaca gcagccctag gttttttctt ccccgtgaaa ccagagatga 
tttggaagat gggggtgaag gacactcggg aactagggca aagacagggc tagaggtatg 
tggagctggt actgtctctg gaattttaat cacaataaag tttggcaagg aatgtgtact 
tgtacttaca ttcagaggca ctgtggcctt tagttcctta gggtcaccgt ggccagcacc 
aagggatcct gcccactcca tgtcccaggt ccaagggtta cttttttttt tttttttttt 
ttttgagatg gagtgtctca ctgtcagcca ggctggagtg cagtggtaca acctccactc 
actgcaacct ctgcctcctg gattcaagca attctcctgc cctcagcctc ccaagtagct 
gggattagag gtgcccacca tcacgcccgg ctaatttttg tatttttttg agatggagtg 
tctcactgtc agccaggctg gagtgcagtg gcacaatctc ggctcactgc aacctctgcc 
tccgagattc aagcgattct cc 



2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3562 



<210> 36 

<211> 4502 

<212> DNA 

<213> Homo sapien 

<400> 36 

gcgaattggg cctctagacg catgctctag caggcggcgc agtgtgctgg aaaggaccaa 
tcgacctgag gacttgcgag ccgctcagct cccgggacgt ttggagctgc tgctaaataa 
tttctgctca gccatgtcgc cggctccaga tgcagccccg gctcctgcgt cgatctccct 
gtttgacctc agcgcggatg ctccggtctt tcagggcctg agcctggtga gccacgcgcc 



60 
120 
180 
240 



WO 2004/050900 



PCT7US2003/040131 



63 



tggggaggct 


ctggcccggg 


ctccgcgtac ttcctgttca ggctcagggg agagagaaag 


300 


cccagaaaga 


aagctactcc 


agggtcctat ggatatttca gagaagttat tttgttcaac 


360 


ttgtgaccag 


accttccaga 


accaccaaga acagagggaa cattataagc ttgactggca 


420 


tcggtttaac 


ctaaagcaac 


gtctcaagga caagcctctc ctgtctgccc tggactttga 


460 


aaagcagagc 


tccacaggag atctttccag catctcggga tcagaagact cagactcagc 


540 


cagtgaggag 


gacttgcaga 


cactggatcg ggagagggct acatttgaga agttgagccg 


600 


acccccaggc 


ttttaccctc 


atcgagttct tttccagaat gcccagggcc agtttcttta 


660 


tgcctaccgc 


tgtgtcctag gccctcatca gaggcaagtg acagtacagg ttgcatggct 


720 


aaccccagcc 


ttttgtacac 


ccagcctaga tttcccttag cctagtgcat ttctccctgc 


780 


ttctcttcct 


tcctgttggt 


tgtgggtatc acgcttgaca acatagtctt gaattgaaac 


840 


tggcctgaat 


tgtggaaggg 


ttaagtgtaa tggaattaag tttggaaaca aaccaattta 


900 


ggtttgaatt 


gtgattctac 


cacttcatct tgactgcctc gccaagcctc aaatttcctt 


960 


gtttgtcaaa tggaaacaag cctcagggtg gtaatattaa atgagataat gtttgtgaaa 


1020 


attggccctt 


agtttcttgc 


tctcagtgaa tgcaacagat atgtttctga tgcttctctg 


1080 


tcatcaagga 


tcccccagaa 


gaggcagaac tgctgctaca gaacctgcaa agtagaggtc 


1140 


ccagagactg 


cgtggtgctc 


atggctgcag ctgggcactt tgctggtgct atatttcaag 


1200 


ggtgagaggg 


tgctgcatgg gggtaaggat ggagtggatc ctgttagcca gggagcacca 


1260 


gctggtctcc 


agtactgagt 


ctgtgctgtc tacagaagag aagtggtgac acacaaaact 


1320 


tttcaccgct 


atacggttcg ggccaagcgg ggcacagccc aggggcttcg ggatgcccga 


1380 


ggtgggccat 


cacactctgc 


tggagccaac ctgaggcgct acaatgaagc cacactatat 


1440 


aaggtgagtt 


aagcctttta 


gatctggtgg tactgatcca tacttttttt gcatctctgt 


1500 


tcaactcact 


gttaagtagg 


gaggtttttg actcatatcc ttccaatttt cattataggt 


1560 


agacaaaggt 


agatgaggga 


ttccccagga gccttagtcc tctattcttt ctccgtgtta 


1620 


tcctctactt 


tccacccctt 


gcacttctgt gggccatttt ttttagttgg agatttataa 


1680 


atagcaagtg 


aaggtatgca 


ggactgagtt actctcttct ctaggatgtt cgtgacctgc 


1740 


tggcagggcc 


aagctgggct 


aaggcgctgg aggaggctgg tacaatactg ttgcgtgctc 


1800 


cccgctctgg 


ccggtctttg 


ttctttggag gcaagggagc acccctgcaa aggggggatc 


1860 


cccgactttg 


ggatatcccc 


ctcgccaccc gcagacccac cttccaagag ctacagcgtg 


1920 


tgctccataa 


gctgaccact 


ttgcatgtct atgaagaaga ccctcgggaa gcagtcagac 


1980 


tgcactcacc 


tcagacacac 


tggaaaacag taagagagga gagaaagaag cctactgagg 


2040 
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aagaaataag aaagatctgc agggatgaaa aggaagcgct ggggcagaat gaggaatctc 2100 

ccaaacaggg tttgattact atctggcaac tgtcagatct gagtttctgt cctaaaaatg 2160 

cactggcaaa ttccctactc tcataactga cccatttcct tatttaaaca cacacccaca 2220 

gtgtctatct tagcaattac tgagacatgt ttagtgacct ttccaaatct attatctctt 2280 

ttgttatgga tgtttcccca gagctaagaa aatctttgtg tggagcagat gcgtggatgt 234 0 

gttgcatgga tctcacatag tttatctttg attggatgct gtggtttgag gctgagggct 2400 

gcttccatgc tttcctctat tgtgcggcac cctaaggcta ctcatacgga gagtccaggg 2460 

acttgttctt gtctgggctg agagcctcat ttctgcatgt gatataatgg actcagctct 2520 

tgatgtgcta cttttatctg ggtgacaaaa taaaaacaat ttcatttcag gctttcattc 2580 

ttatgcatct gcttctaccc tctgagtact ccctagatgg atttcacatg tcaggttttt 2640 

ccctaggttc agggtcggag ggagaagatg gctttcaggt agagttggag ctagtggagt 2700 

tgactgtggg gactctggat ctttgtgagt ctgaagtatt gcccaagcgg aggaggagaa 276 0 

aaaggaataa gaaggagaaa agccgagacc aggaggctgg ggcacatcgg actcttctcc 2820 

agcaaactca agaagaggag ccttccacac agtcatccca ggcagttgct gcccccttgg 2880 

gccctttgct ggatgaggcc aaagcccctg gtcagccaga gctctggaat gcactgcttg 2940 

ctgcttgccg agctggagat gttggagtgc taaagctgca gctagctccc agccctgcag 3000 

accctagagt tctgtctctg ctcagtgccc ccttgggctc cggtggcttt actctcctgc 3060 

atgcagcagc tgcagctgga agaggctcag tggttcgtct gctgctggaa gcaggtgctg 312 0 

accccactgt gcagtgagta aaggtcccca tcctgagtca ttttgggtat agagaggata 3180 

atttggaggt taaagttggc attggctact tggcaatatt cccttggtct ggttgggtga 3240 

tgttgggaaa ggcaggaggc aagttaaacc catttttgga gtttttgcac ctagggactc 3300 

tcgggcccgg ccaccttata ctgttgcggc tgacaaatca acacgtaatg agttccgaag 3360 

gttcatggag aagaatccag atgcctacga ttacaacaag gctcaggtca tctggaatag 3420 

gaaggacaag cagagtggga aggcatcaca gatcctggct agatcctttc tacttcttgc 3480 

agttttacca aagataactt atagggtctt atatttgaaa tcctccaggt gccaggacca 3540 

ttgacaccag aaatggaggc acggcaggct acacggaaaa gggagcagaa ggcagcccgg 3600 

cggcaacggg aggaacagca gcagaggcag caggagcagg aggagcgtga acgagaagag 3660 

cagcggcgat ttgccgccct cagtgaccga gagaagagag ctctggctgc agagcgccga 3720 

ctcgctgccc agttgggagc ccctacctct ccaatccctg actctgcaat cgtcaatact 3780 

cgacgctgct ggagttgtgg ggcatccctc caaggcctga ctccctttca ctacctcgac 3840 

ttctctttct gctccacacg ttgcctccag gatcatcgcc gtcaggcagg gaggccctct 3900 
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tcctgatctc 


ttacagctct 


acctggggcc 


aactcaggga 


cctgagaggg 


cacattcaca 


3960 


gcagccctag 


gttttttctt 


ccccgtgaaa 


ccagagatga 


tttggaagat 


gggggtgaag 


4020 


gacactcggg 


aactagggca 


aagacagggc 


tagaggtatg 


tggagctggt 


actgtctctg 


4080 


gaattt'taat 


cacaataaag 


tttggcaagg 


aatgtgtact 


tgtacttaca 


ttcagaggca 


4140 


ctgtggcctt 


tagttcctta 


gggtcaccgt 


ggccagcacc 


aagggatcct 


gcccactcca 


4200 


tgtcccaggt 


ccaagggtta 


cttttttttt 


tttttttttt 


ttttgagatg 


gagtgtctca 


4260 


ctgtcagcca 


ggctggagtg 


cagtggtaca 


acctccactc 


actgcaacct 


ctgcctcctg 


4320 


gattcaagca 


attctcctgc 


cctcagcctc 


ccaagtagct 


gggattagag 


gtgcccacca 


4380 


tcacgcccgg 


ctaatttttg 


tatttttttg 


agatggagtg 


tctcactgtc 


agccaggctg 


4440 


gagtgcagtg 


gcacaatctc 


ggctcactgc 


aacctctgcc 


tccgagattc 


aagcgattct 


4500 


cc 












4502 


<210> 37 

<211> 3316 

<212> DNA 

<213> Homo sapien 












<400> 37 
gcgaattggg 


cctctagacg 


catgctctag 


caggcggcgc 


agtgtgctgg 


aaaggaccaa 


60 


tcgacctgag 


gacttgcgag 


ccgctcagct 


cccgggacgt 


ttggagctgc 


tgctaaataa 


120 


tttctgctca 


gccatgtcgc 


cggctccaga 


tgcagccccg 


gctcctgcgt 


cgatctccct 


180 


gtttgacctc 


agcgcggatg 


ctccggtctt 


tcagggcctg 


agcctggtga 


gccacgcgcc 


240 


tggggaggct 


ctggcccggg 


ctccgcgtac 


ttcctgttca 


ggctcagggg 


agagagaaag 


300 


cccagaaaga 


aagctactcc 


agggtcctat 


ggatatttca 


gagaagttat 


tttgttcaac 


360 


ttgtgaccag 


accttccaga 


accaccaaga 


acagagggaa 


cattataagc 


ttgactggca 


420 


tcggtttaac 


ctaaagcaac 


gtctcaagga 


caagcctctc 


ctgtctgccc 


tggactttga 


480 


aaagcagagc 


tccacaggag 


atctttccag 


catctcggga 


tcagaagact 


cagactcagc 


540 


cagtgaggag 


gacttgcaga 


cactggatcg 


ggagagggct 


acatttgaga 


agttgagccg 


600 


acccccaggc 


ttttaccctc 


atcgagttct 


tttccagaat 


gcccacrciqcc 


agtttcttta 


660 


tgcctaccgc 


tgtgtcctag 


gccctcatca 


ggatccccca 


gaagaggcag 


aactgctgct 


720 


acagaacctg 


caaagtagag 


gtcccagaga 


ctgcgtggtg 


ctcatggctg 


cagctgggca 


780 


ctttgctggt 


gctatatttc 


aaggaagaga 


agtggtgaca 


cacaaaactt 


ttcaccgcta 


840 


tacggttcgg 


gccaagcggg 


gcacagccca 


ggggcttcgg 


gatgcccgag 


gtgggccatc 


900 


acactctgct 


ggagccaacc 


tgaggcgcta 


caatgaagcc 


acactatata 


aggatgttcg 


960 
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tgacctgctg gcagggccaa 


gctgggctaa 


ggcgctggag 


gaggctggta 


caatactgtt 


1020 


gcgtgctccc 


cgctctggcc 


ggtctttgtt 


ctttggaggc 


aagggagcac 


ccctgcaaag 


1080 


gggggatccc 


cgactttggg 


atatccccct 


cgccacccgc 


agacccacct 


tccaagagct 


1140 


acagcgtgtg ctccataagc 


tgaccacttt 


gcatgtctat 


gaagaagacc 


ctegggaage 


1200 


agtcagactg cactcacctc 


agacacactg gaaaacagta 


agagaggaga 


gaaagaagee 


1260 


tactgaggaa gaaataagaa 


agate tgcag ggatgaaaag 


gaagcgctgg ggcagaatga 


1320 


ggaatctccc 


aaacagggtt 


tgattactat 


ctggcaactg 


tcagatctga 


gtttctgtcc 


1380 


taaaaatgca 


ctggcaaatt 


ccctactctc 


ataactgacc 


catttcctta 


tttaaacaca 


1440 


cacccacagt 


gtctatctta 


gcaattactg 


agacatgttt 


agtgaccttt 


ccaaatctat 


1500 


tatctctttt 


gttatggatg 


tttccccaga 


gctaagaaaa 


tctttgtgtg gagcagatgc 


1560 


gtggatgtgt 


tgcatggatc 


tcacatagtt 


tatctttgat 


tggatgctgt 


ggtttgaggc 


1620 


tgagggctgc 


ttccatgctt 


tcctctattg 


tgcggcaccc 


taaggctact 


cataeggaga " 


1680 


gtccagggac 


ttgttcttgt 


ctgggctgag 


agectcattt 


ctgcatgtga 


tataatggac 


1740 


tcagctcttg atgtgctact 


tttatctggg 


tgacaaaata 


aaaacaattt 


catttcaggc 


1800 


tttcattctt 


atgcatctgc 


ttctaccctc 


tgagtactcc 


ctagatggat 


ttcacatgtc 


1860 


aggtttttcc 


ctaggttcag 


ggtcggaggg 


agaagatggc 


tttcaggtag 


agttggagct 


1920 


agtggagttg 


actgtgggga 


ctctggatct 


ttgtgagtct 


gaagtattgc 


ecaageggag 


1980 


gaggagaaaa 


aggaataaga 


aggagaaaag 


ccgagaccag 


gaggctgggg 


cacatcggac 


2040 


tcttctccag caaactcaag 


aagaggagee 


ttccacacag 


tcatcccagg 


cagttgctgc 


2100 


ccccttgggc 


cctttgctgg 


atgaggccaa agcccctggt 


cagccagagc 


tctggaatgc 


2160 


actgcttgct 


gcttgccgag 


ctggagatgt 


tggagtgcta 


aagctgeage 


tagctcccag 


2220 


ccctgcagac 


cctagagttc 


tgtctctget 


cagtgccccc 


ttgggctccg 


gtggctttac 


2280 


tctcctgcat 


gcagcagctg 


cagctggaag aggctcagtg gttcgtctgc 


tgctggaagc 


2340 


aggtgccagg 


accattgaca 


ccagaaatgg aggcaeggea ggctacaegg 


aaaagggagc 


2400 


agaaggcagc 


ccggcggcaa 


egggaggaac 


agcagcagag 


gcagcaggag 


caggaggagc 


2460 


gtgaacgaga 


agagcagcgg 


egatttgecg 


ccctcagtga 


ccgagagaag 


agagctctgg 


2520 


ctgcagagcg 


ccgactcgct 


gcccagttgg gagcccctac 


ctctccaatc 


cctgactctg 


2580 


caatcgtcaa 


tactcgacgc 


tgctggagtt 


gtggggcatc 


cctccaaggc 


ctgactccct 


2640 


ttcactacct 


cgacttctct 


ttctgctcca 


cacgttgcct 


ccaggatcat 


cgccgtcagg 


2700 


cagggaggcc 


ctcttcctga 


tctcttacag ctctacctgg ggccaactca 


gggacctgag 


2760 
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cty y y oct^ca. l. l- 


cacaycctyou 


ctaggttttt 


tcttccccgt 


gaaaccagag atgatttgga 


2820 


agatgggggc 


yoayyaCaCt 


cgggaactag ggcaaagaca gggctagagg 


tatgtggagc 


2880 


tygtaccycc 


LCLyyaaLCL 


taatcacaat 


aaagtttggc 


aaggaatgtg 


tacttgtact 


2940 


LaCaULCaya, 


ggcaccgcgg 


cctttagttc 


cttagggtca 


ccgtggccag 


caccaaggga 


3000 




tccatgtccc 


aggtccaagg 


gttacttttt 


tttttttttt 


ttttttttga 


3060 


gatggagtgt 


ctcactgtca 


gccaggctgg 


agtgcagtgg 


tacaacctcc 


actcactgca 


3120 


acctctgcct 


cctggattca 


agcaattctc 


ctgccctcag 


cctcccaagt 


agctgggatt 


3180 


agaggtgccc 


accatcacgc 


ccggctaatt 


tttgtatttt 


tttgagatgg agtgtctcac 


3240 


tgtcagccag 


gctggagtgc agtggcacaa 


tctcggctca 


ctgcaacctc 


tgcctccgag 


~> j \j \j 


attcaagcga 


ttctcc 










3316 


<210> 38 

<211> 4629 

<212> DNA 

<213> Homo sapien 












<400> 38 
gcgaattggg 


cctctagacg 


catgctctag 


caggcggcgc 


agtgtgctgg aaaggaccaa 




tcgacctgag 


gacttgcgag 


ccgctcagct 


cccgggacgt 


ttggagctgc 


tgctaaataa 


ion 


tttctgctca 


gccatgtcgc 


cggctccaga 


tgcagccccg gctcctgcgt 


cgatctccct 


1 ftft 

JLO \i 


gtttgacctc 


agcgcggatg 


ctccggtctt 


tcagggcctg 


agcctggtga 


gccacgcgcc 




tggggaggct 


ctggcccggg ctccgcgtac 


ttcctgttca 


ggctcagggg 


agagagaaag 


300 


cccagaaaga 


aagctactcc 


agggtcctat 


ggatatttca 


gagaagttat 


tttgttcaac 


\> \j 


ttgtgaccag 


accttccaga 


accaccaaga 


acagagggaa 


cattataagc 


ttgactggca 


/on 
*± & \j 


tcggtttaac 


ctaaagcaac gtctcaagga 


caagcctctc 


ctgtctgccc 


tggactttga 


fx D U 


aaagcagagc 


tccacaggtg 


atgagtggta gggggaccta 


tgtaggagta 


ggacatggag 


ZLA ft 


gtataagaaa 


gcactagttt 


agagtttgag 


gaaagaacaa 


aaatggggta 


ccaaacttgt 


cnn 
o u u 


gtatctcata 


ttgtctgtct 


tcaggagatc 


tttccagcat 


ctcgggatca 


gaagactcag 


o O \J 


actcagccag 


tgaggaggac 


ttgcagacac 


tggatcggga 


gagggctaca 


tttgagaagt 


720 


tgagccgacc 


cccaggcttt 


taccctcatc 


gagttctttt 


ccagaatgcc 


cagggccagt 


780 


ttctttatgc 


ctaccgctgt 


gtcctaggcc 


ctcatcagag gcaagtgaca gtacaggttg 


840 


catggctaac 


cccagccttt- 


tgtacaccca 


gcctagattt 


cccttagcct 


agtgcatttc 


900 


tccctgcttc 


tcttccttcc 


tgttggttgt 


gggtatcacg 


cttgacaaca 


tagtcttgaa 


960 


ttgaaactgg 


cctgaattgt ggaagggtta agtgtaatgg aattaagttt ggaaacaaac 


1020 



WO 2004/050900 



PCT/US2003/040131 



68 



caatttaggt 


ttgaattgtg 


attctaccac 


ttcatcttga 


ctgcctcgcc 


aagcctcaaa 


1080 


tttccttgtt 


tgtcaaatgg 


aaacaagcct 


cagggtggta 


atattaaatg 


agataatgtt 


1140 


tgtgaaaatt 


ggcccttagt 


ttcttgctct 


cagtgaatgc 


aacagatatg 


tttctgatgc 


1200 


ttctctgtca 


tcaaggatcc 


cccagaagag 


gcagaactgc 


tgctacagaa 


cctgcaaagt 


1260 


agaggtccca 


gagactgcgt 


ggtgctcatg 


gctgcagctg 


ggcactttgc 


tggtgctata 


1320 


tttcaagggt 


gagagggtgc 


tgcatggggg 


taaggatgga 


gtggatcctg 


ttagccaggg 


1380 


agcaccagct 


ggtctccagt 


actgagtctg 


tgctgtctac 


agaagagaag 


tggtgacaca 


1440 


caaaactttt 


caccgctata 


cggttcgggc 


caagcggggc 


acagcccagg 


ggcttcggga 


1500 


tgcccgaggt 


gggccatcac 


actctgctgg 


agccaacctg 


aggcgctaca 


atgaagccac 


1560 


actatataag 


gtgagttaag 


ccttttagat 


ctggtggtac 


tgatccatac 


tttttttgca 


1620 


tctctgttca 


actcactgtt 


aagtagggag 


gtttttgact 


catatccttc 


caattttcat 


1680 


tataggtaga 


caaaggtaga 


tgagggattc 


cccaggagcc 


ttagtcctct 


attctttctc 


1740 


cgtgttatcc 


tctactttcc 


accccttgca 


cttctgtggg 


ccattttttt 


tagttggaga 


1800 


tttataaata 


gcaagtgaag 


gtatgcagga 


ctgagttact 


ctcttctcta 


ggatgttcgt 


1860 


gacctgctgg 


cagggccaag 


ctgggctaag 


gcgctggagg 


aggctggtac 


aatactgttg 


1920 


cgtgctcccc 


gctctggccg 


gtctttgttc 


tttggaggca 


agggagcacc 


cctgcaaagg 


1980 


ggggatcccc 


gactttggga 


tatccccctc 


gccacccgca 


gacccacctt 


ccaagagcta 


2040 


cagcgtgtgc 


tccataagct 


gaccactttg 


catgtctatg 


aagaagaccc 


tcgggaagca 


2100 


gtcagactgc 


actcacctca 


gacacactgg 


aaaacagtaa 


gagaggagag 


aaagaagcct 


2160 


actgaggaag 


aaataagaaa 


gatctgcagg 


gatgaaaagg 


aagcgctggg 


gcagaatgag 


2220 


gaatctccca 


aacagggttt 


gattactatc 


tggcaactgt 


cagatctgag 


tttctgtcct 


2280 


aaaaatgcac 


tggcaaattc 


cctactctca 


taactgaccc 


atttccttat 


ttaaacacac 


2340 


acccacagtg 


tctatcttag 


caattactga 


gacatgttta 


gtgacctttc 


caaatctatt 


2400 


atctcttttg 


ttatggatgt 


ttccccagag 


ctaagaaaat 


ctttgtgtgg 


agcagatgcg 


2460 


tggatgtgtt 


gcatggatct 


cacatagttt 


atctttgatt 


ggatgctgtg 


gtttgaggct 


2520 


gagggctgct 


tccatgcttt 


cctctattgt 


gcggcaccct 


aaggctactc 


atacggagag 


2580 


tccagggact 


tgttcttgtc 


tgggctgaga 


gcctcatttc 


tgcatgtgat 


ataatggact 


2640 


cagctcttga 


tgtgctactt 


ttatctgggt 


gacaaaataa 


aaacaatttc 


atttcaggct 


2700 


ttcattctta 


tgcatctgct 


tctaccctct 


gagtactccc 


tagatggatt 


tcacatgtca 


2760 


ggtttttccc 


taggttcagg 


gtcggaggga 


gaagatggct 


ttcaggtaga 


gttggagcta 


2820 


gtggagttga 


ctgtggggac 


tctggatctt 


tgtgagtctg 


aagtattgcc 


caagcggagg 


2880 
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aggagaaaaa ggaataagaa ggagaaaagc cgagaccagg aggctggggc acatcggact 2940 

cttctccagc aaactcaaga agaggagcct tccacacagt catcccaggc agttgctgcc 3000 

cccttgggcc ctttgctgga tgaggccaaa gcccctggtc agccagagct ctggaatgca 3060 

ctgcttgctg cttgccgagc tggagatgtt ggagtgctaa agctgcagct agctcccagc 3120 

cctgcagacc ctagagttct gtctctgctc agtgccccct tgggctccgg tggctttact 3180 

ctcctgcatg cagcagctgc agctggaaga ggctcagtgg ttcgtctgct gctggaagca 3240 

ggtgctgacc ccactgtgca gtgagtaaag gtccccatcc tgagtcattt tgggtataga 3300 

gaggataatt tggaggttaa agttggcatt ggctacttgg caatattccc ttggtctggt 3360 

tgggtgatgt tgggaaaggc aggaggcaag ttaaacccat ttttggagtt tttgcaccta 3420 

gggactctcg ggcccggcca ccttatactg ttgcggctga caaatcaaca cgtaatgagt 3480 

tccgaaggtt catggagaag aatccagatg cctacgatta caacaaggct caggtcatct 354 0 

ggaataggaa ggacaagcag agtgggaagg catcacagat cctggctaga tcctttctac 3600 

ttcttgcagt tttaccaaag ataacttata gggtcttata tttgaaatcc tccaggtgcc 3660 

aggaccattg acaccagaaa tggaggcacg gcaggctaca cggaaaaggg agcagaaggc 3720 

agcccggcgg caacgggagg aacagcagca gaggcagcag gagcaggagg agcgtgaacg 3780 

agaagagcag cggcgatttg ccgccctcag tgaccgagag aagagagctc tggctgcaga 3840 

gcgccgactc gctgcccagt tgggagcccc tacctctcca atccctgact ctgcaatcgt 3900 

caatactcga cgctgctgga gttgtggggc atccctccaa ggcctgactc cctttcacta 3960 

cctcgacttc tctttctgct ccacacgttg cctccaggat catcgccgtc aggcagggag 4020 

gccctcttcc tgatctctta cagctctacc tggggccaac tcagggacct gagagggcac 408 0 

attcacagca gccctaggtt ttttcttccc cgtgaaacca gagatgattt ggaagatggg 4140 
ggtgaaggac actcgggaac tagggcaaag acagggctag aggtatgtgg agctggtact 4200 
gtctctggaa ttttaatcac aataaagttt ggcaaggaat gtgtacttgt acttacattc 4260 
agaggcactg tggcctttag ttccttaggg tcaccgtggc cagcaccaag ggatcctgcc 432 0 
cactccatgt cccaggtcca agggttactt tttttttttt tttttttttt tgagatggag 4380 
tgtctcactg tcagccaggc tggagtgcag tggtacaacc tccactcact gcaacctctg 4440 
cctcctggat tcaagcaatt ctcctgccct cagcctccca agtagctggg attagaggtg 4500 
cccaccatca cgcccggcta atttttgtat ttttttgaga tggagtgtct cactgtcagc 4560 
caggctggag tgcagtggca caatctcggc tcactgcaac ctctgcctcc gagattcaag 4620 
cgattctcc 4629 
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<210> 39 
<211> 4090 
<212> DNA 
<213> Homo sapien 




<400> 39 


^% /~% 4~ ^ aa rta 

ccuctagacg 


catgetctag 


U t_y dec cgag 


ga ccc gcgag 


ccgctcagct 




t~tf~* r~* a 4~ <*t4~ f*t~m 


cggctccaga 




agegeggatg 


ctccggtctt 


^ygggaggcx 


cuggcccggg 


ctccgcgtac 


r+r* fro a a rra 


aagcuacccc 


agggtcctat 


ttguyaCCag 


acct tccaga 


accaccaaga 


tcyy tt taaC 


cuaaagcaac 


gtctcaagga 


aaayCayagC 


tccacaggag 


atctttccag 


cagegaggag 


gact tgcaga 


cactggatcg 


aut-t/CCayyC 


^ ^ ^~ 4* r^k yi 4* 


atcgagttct 


tgcctaccgc 


tgtgtcctag 


gccctcatca 


aCayaaCCCy 


caaag t agag 


gtcccagaga 


ctLcgccggc 


gctatat tt c 


aaggaagaga 


caegge cegg 


gecaageggg 


gcacagccca 


acacxccgcx 


ggagccaacc 


tgaggegcta 


tgacctgc ug 


geagggecaa 


gctgggctaa 


gcgugct.ccc 


cgctctggcc 


ggtctttgtt 


gggggacccc 


cgactttggg 


atatccccct 


acagcgtgtg 


ctccataagc 


tgaccacttt 


agtcagactg 


cactcacctc 


agacacactg 


tactgaggaa 


gaaataagaa 


ayatu L.y <^.cty 


ggaatctccc 


aaacagggtt 


tgattactat 


taaaaatgea 


ctggcaaatt 


ccctactctc 


cacccacagt 


gtctatctta 


gcaattactg 


tatctctttt 


gttatggatg 


tttccccaga 


gtggatgtgt 


tgcatggatc 


tcacatagtt 
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caggcggcgc 


agtgtgctgg aaaggaccaa 


60 


cccgggacgt 


ttggagctgc 


tgctaaataa 


120 


tgcagccccg 


gctcctgcgt 


cgatctccct 


180 


teagggectg 


agcctggtga 


gccacgcgcc 


240 


ttcctgttca 


ggctcagggg 


agagagaaag 


300 


ggatatttca 


gagaagttat 


tttgttcaac 


360 


acagagggaa 


cattataagc 


ttgactggca 


420 


caagcctctc 


ctgtctgccc 


tggactttga 


480 


catcteggga 


tcagaagact 


cagactcagc 


540' 


ggagagggct 


acatttgaga 


agttgagccg 


600 


tttccagaat 


gcccagggcc 


agtttcttta 


660 


ggatccccca 


gaagaggcag 


aactgetget 


720 


ctgcgtggtg 


ctcatggctg 


cagctgggca 


780 


agtggtgaca 


cacaaaactt 


ttcaccgcta 


840 


ggggcttegg 


gatgeccgag gtgggccatc 


900 


caatgaagee 


acactatata 


aggatgttcg 


960 


ggcgctggag 


gaggctggta 


caatactgtt 


1020 


ctttggaggc 


aagggagcac 


ccctgcaaag 


1080 


cgccacccgc 


agacccacct 


tccaagagct 


1140 


gcatgtctat 


gaagaagacc 


ctegggaage 


1200 


gaaaacagta 


agagaggaga 


gaaagaagee 


1260 


ggatgaaaag 


gaagcgctgg ggcagaatga 


1320 


ctggcaactg 


tcagatctga gtttctgtcc 


1380 


ataactgacc 


catttcctta 


tttaaacaca 


1440 


agacatgttt 


agtgaccttt 


ccaaatctat 


1500 


gctaagaaaa 


tctttgtgtg gagcagatgc 


1560 


tatctttgat 


tggatgctgt 


ggtttgaggc 


1620 
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tgagggctgc ttccatgctt tcctctattg tgcggcaccc taaggctact catacggaga 1680 

gtccagggac ttgttcttgt ctgggctgag agcctcattt ctgcatgtga tataatggac 1740 

tcagctcttg atgtgctact tttatctggg tgacaaaata aaaacaattt catttcaggc 1800 

tttcattctt atgcatctgc ttctaccctc tgagtactcc ctagatggat ttcacatgtc 1860 

aggtttttcc ctaggttcag ggtcggaggg agaagatggc tttcaggtag agttggagct 1920 

agtggagttg actgtgggga ctctggatct ttgtgagtct gaagtattgc ccaagcggag 1980 

gaggagaaaa aggaataaga aggagaaaag ccgagaccag gaggctgggg cacatcggac 204 0 

tcttctccag caaactcaag aagaggagcc ttccacacag tcatcccagg cagttgctgc 2100 

ccccttgggc cctttgctgg atgaggccaa agcccctggt cagccagagc tctggaatgc 2160 

actgcttgct gcttgccgag ctggagatgt tggagtgcta aagctgcagc tagctcccag 2220 

ccctgcagac cctagagttc tgtctctgct cagtgccccc ttgggctccg gtggctttac 2280 

tctcctgcat gcagcagctg cagctggaag aggctcagtg gttcgtctgc tgctggaagc 2340 

aggtgctgac cccactgtgc agtgagtaaa ggtccccatc ctgagtcatt ttgggtatag 2400 

agaggataat ttggaggtta aagttggcat tggctacttg gcaatattcc cttggtctgg 2460 

ttgggtgatg ttgggaaagg caggaggcaa gttaaaccca tttttggagt ttttgcacct 2520 

agggactctc gggcccggcc accttatact gttgcggctg acaaatcaac acgtaatgag 2580 

ttccgaaggt tcatggagaa gaatccagat gcctacgatt acaacaaggc tcaggtcatc 2640 

tggaatagga aggacaagca gagtgggaag gcatcacaga tcctggctag atcctttcta 2700 

cttcttgcag ttttaccaaa gataacttat agggtcttat atttgaaatc ctccaggtgc 2760 

caggaccatt gacaccagaa atggaggcac ggcaggctac acggaaaagg gagcagaagg 2 82 0 

cagcccggcg gcaacgggag gaacagcagc agaggcagca ggagcaggag gagcgtgaac 2880 

gagaagagca gcggcgattt gccgccctca gtgaccgaga gaaggtgagg ctggaggttc 2940 

tcttgtccat ggcaaggctt cctagaggtc taaccccttc cccagcgtgc agctgtcacc 3000 

tcttgggtgc cctactcgcc actggggcct tgtccttaac acaacttgtc tccctcagag 3 060 

agctctggct gcagagcgcc gactcgctgc ccagttggga gcccctacct ctccaatccc 3120 

tgactctgca atcgtcaata ctcggtatgg ggtgcgggat gagggagtgg gtggactgta 3180 

atgttgcagg gaccattggg ggcaagagaa atgagtacct gcacagtatt cagaagggtg 3240 

gttgagggaa tttgggatgt tctggcagat gctgcagtga aaacagaaat ggcaggagga 3300 

gggacgggga gagcgtggat tggaaagttt ctggggtacc tgtccagcca tttttcttcc 3360 

tcttgttcag acgctgctgg agttgtgggg catccctcca aggcctgact ccctttcact 3420 
acctcgactt ctctttctgc tccacacgtt gcctccagga tcatcgccgt caggcaggga 348 0 
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ggccctcttc 


ctgatctctt 


acagctctac 


ctggggccaa 


ctcagggacc 


tgagagggca 


3540 


cattcacagc 


agccctaggt 


tttttcttcc 


ccgtgaaacc 


agagatgatt 


tggaagatgg 


3600 


gggtgaagga 


cactcgggaa 


ctagggcaaa 


gacagggcta 


gaggtatgtg 


gagctggtac 


3660 


tgtctctgga 


attttaatca 


caataaagtt 


tggcaaggaa 


tgtgtacttg 


tacttacatt 


3720 


cagaggcact 


gtggccttta 


gttccttagg 


gtcaccgtgg 


ccagcaccaa 


gggatcctgc 


3780 


ccactccatg 


tcccaggtcc 


aagggttact 


tttttttttt 


tttttttttt 


ttgagatgga 


3840 


gtgtctcact 


gtcagccagg 


ctggagtgca 


gtggtacaac 


ctccactcac 


tgcaacctct 


3900 


gcctcctgga 


ttcaagcaat 


tctcctgccc 


tcagcctccc 


aagtagctgg 


gattagaggt 


3960 


gcccaccatc 


acgcccggct 


aatttttgta 


tttttttgag 


atggagtgtc 


tcactgtcag 


4020 


ccaggctgga 


gtgcagtggc 


acaatctcgg 


ctcactgcaa 


cctctgcctc 


cgagattcaa 


4080 


gcgattctcc 












4090 


<210> 40 
<211> 3770 
<212> DNA 
<213> Homo sapien 












<400> 40 
ctacttgcgg 


ggctgagatg 


ggtggatcgc 


ttgaacacag 


gaggttaagg 


ctgcagtgag 


60 


ccgagatcgt 


gccactgcac 


actagcctgg 


gcggcgggag 


tgagaccgtg 


tctcaaaaac 


120 


aacaacaaca 


acaacaacaa 


caaaaaacaa 


aaaatggacc 


gaggttgggg 


tggcaaggct 


180 


gggctggaat 


ggaggctgag 


gacaggccct 


ggcagcgccc 


agggggtaga 


gggcgtgcag 


240 


caggaagcaa 


gggccaccct 


cagggatggg 


ggactgggcc 


tggcagctgc 


cgaggggtag 


300 


gcaggatgca 


tgtggagagc 


cacgtggggc 


tgcggctgag 


ccaaggattg 


tgagaggaat 


360 


cctgctgctc 


tgggtggcag 


gcagaaggcc 


tgggccacag 


gtggggcctg 


ctggggagac 


420 


cacaggtgag 


gcaggtatgg 


ctgaccctgg 


gcgctcggca 


caccggtact 


cctcctgcct 


480 


ggctgctccc 


tcagcaggtg 


tcagagtctg 


agtcaccttt 


tcctcacgtg 


acccccacca 


540 


ggaccaggca 


gggaggtggc 


catctttggc 


cctgactcag 


ccctgcaggg 


agaagagggg 


600 


ctggaggtgc 


aggtggcctg 


ccactgtctc 


ccccaacccc 


tgcccacctg 


ggtcttcrtqc 


660 


tgtgtgtact 


tctgctgcgt 


ctgtcctccc 


ttgtgaccag 


cttctgctgg 


acctgattgt 


720 


ccagccatga 


tcttctgatc 


atcatggttt 


ctggacacac 


atctgttgga 


ctaaggaatt 


780 


tgtgttcctg 


gccacagaac 


agtgccacct 


ctgcctcaag 


cagttcctgg 


tctgggcact 


840 


cccccaggga 


ggccggagag 


gactgagccg 


tctcctgatg 


cagaaagtca 


gccccgctgg 


900 


gcagtgctcg 


cccaggaagt 


agggccccac 


ccaacggtgt 


gctgactgtg 


gaacactgtg 


960 
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tctgcctagg cgggtctggc cacattcctg cagcatgatg gttggcagtt agcttcctgg 1020 

agtgtggacg ggtccttccc tgcctgactg acttgtccac agggcagcaa gaagatgcgc 1080 

cccttggtgt gtcccgagtg tctcctgtag gcagctgagc tttcgcaggt ctggctggga 1140 

gccaaggtta ccaatctggg gggtaaccca gccatcatct ggatgagcgt tggggtcttt 1200 

ctgctctgga ttgcagggca gacgtcgttt tgtgctggtc aggcccctcg cctgcctgat 1260 

tcccttggag ccccccaaga caggagatgg gcagatgtcc tctgaagtgg catgggctgc 1320 

ttgctcactg gccttcccag gaccctgccc aggtggtgct gcttggcccc aggtcgaatt 1380 

cctgtgctgg cagcaagcag gggcctgggc cgtcggctgg ctggggctca tgcagccatc 1440 

ccctttgcag ccatccgggt gacctgcata ggttcctgcg gggtctccaa caaggctaat 1500 

gacacagcgt gggtagtgga ggagggttac ttcaacagtt ccctgtcgct ggcagacaag 1560 

ggtaagtttg ggagccagtt cccgagtggt gacccctggg ggcagccctt ggagtggggc 1620 

ctgtctgtcc tgtcctcccc ctttcctaga tctgcctctt ggatctggca ctggcccatc 1680 

ttgtcccagg gttgtgggcc ctgatggctt tctcataggc gctgggtggc tgaaggggcc 174 0 

acctccccct tgtgctggat ttgggtgggc tctgacccct ggcacagctg tttcaccctt 1800 

gccccattcc caaaccactc tcctgttcaa gccaagctct tatcctcagt ccttcaccca 1860 

agccacagcc ctgtgctgct gtggcgctgg tgactgagac agggagaccc agtgtgagct 1920 

ggtcagctgc agtgcccact cttatcctgg ccctgtggcc acctcaggcc acctcagggg 1980 

gggactgccc cccacctccc tcctgtctct gcagggagcc tgcccgctgg agagcacagc 2040 

ttccccttcc agttcctgct tcctggtgag agcccagcct ggacaggcct ggtgatgacc 2100 

aactggtcct gggaggtggg ctctgcaggg tggaaggcag ccaggtacca gtgcctgctc 2160 

tctccccagc cactgcaccc acgtcctttg agggtccttt cgggaagatc gtgcaccagg 2220 

tgagggccgc catccacacg ccacggtttt ccaaggatca caagtgcagc ctcgtgttct 2280 

atatcttgag cccccacttc ttggaccccg tcttcctctc caccaagagc cattcgcagc 2340 
ggcagcccct gctggccacc ttgagttctg tgcctggtgc gccggagccc tgccctcagg ■ 24 00 

atggcagccc tgcctcacac ccgctgcacc ctcccttgtg catttcaaca ggtgccactg 2460 

tcccctactt tgcagagggc tccggggggc cagtgcccac taccagcacc ttgattcttc 2520 

ctccagagta cagttcttgg ggctacccct atggtgagtc gacagccagg gcttggcagg 2580 

gaggggacgc caagagcccc acgcagaccc tgctttcttc ccgcagaggc cccaccgtct 2640 

tatgagcaga gctgcggcgg cgtggaaccc agcctgaccc ctgagagctg accccgtgct 2700 

gccttctcca ggcaggcctg gcctctgccc tgggactggg gcgcccaggg cctcgtgcct 2760 
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tctctcttgg cctagcctgg cccactcagg acctgcccag cctctgccag ctcctctggc 2820 

atccgccctc ttctccctgg ggctggggtg ggggtggcag ggagctggga cctggagaga 2880 

caactcctgt aaataaaaca ctttatttgt agagctgggc ctcaccggcc tcgccttggg 2940 

caccctccag cctctggcag ggcctgaccc tgaccccatc ctgggctctg ctcatgccag 3000 

ggcagaccct gctcctcagg tgccccctac taccctgggg gcgcgctcag gctaccatgc 3 060 

ctgggcggag ccgcggggct ggctcccttt aacagaggac gggggctggc gctgggcctt 3120 

ccagagccgc ctaaggacac gtccttggag cccggctcat gaaatggggc ctcacgcaac 3180 

cttggtgttt ggttaaattt gtgccttggc tcctgttgcg tttaggcacc caggttccca 3240 

agcggtgtgg tgtggacctc ccctgctgag aaacccgctg gcgcgtggag gggcttgccc 3300 

aggtccctcc gacaggagac cctgctccct gggccttggc ccctgctagc accactgcca 3360 

ggcctggctc tgagggacgc tgcttctccc ctgcccacgt ctccaaggcc tgaaaggcag 3420 

ggttctgtgg aacagcattt ctcaaagggg catcctgcag acgttccagc cccaggggag 3480 

gccccgtgat gctggctggc ctgcctggag ggtgcttcca gaaacaacgg agatggtggt 3540 

gaagaaggct ctgggttcaa attccagttc tcctttaggg cgaaggggtg actgtggcta 3600 

ggtttgtcag ttaccttatc tataaaatgc acaggaccag gccgggcacg gtggctccct 3660 

cctgtaatcc cagtactttg ggaggctgag gcaggtggat tgcttgaggt caggagttcc 3 720 

agatcagcct gggcaacata gtgaaactcc atctgtacaa aaaaaaaaaa 3770 

<210> 41 

<211> 1650 

<212> DNA 

<213> Homo sapien 

<400> 41 

agggaccgta tttcaagtcc tgccctccca ccagcgcctg cgccccgcag cagtgtccgg 60 

ctccctgccg acgccctgga gctgggacgg cctggggcgc tggtcggccg ccttccccca 12 0 

gaaggaggcg tgagcggggc gtggccggaa cgtctctctt cggagccgcg ccccgggcgg 180 

gcacagttac gcgccgcacg gatcatggcc gcagccgctc tggggcaggt gggttcgcag 240 

ccccgcgccg gggtctggga aataaccggg ggaggcgcac ttccgggaag accactggcc 300 

tgcagccggt ccgactcggc gccgctcctc ccgccctggg gttccgctgc agggcgcgga 360 

ggaggatggc gccctttgga aaggagcacg aggctttaat ggggtgcagc tgtttgaggg 42 0 

catgaaggcg ttcaaaggca aagaccagca ggtgcgcctc ttccgcccct ggctcaacat 480 

ggaccggatg ctgcgctcag ccatgcgcct gtgcctgccg agtttcgaca agctggagtt 540 

gctggagtgc atccgccggc tcatcgaagt ggacaaggac tgggtccccg atgccgccgg 600 
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caccagcctc 


tatgtgcggc 


ctgtgctcat 


tgggaacgag 


ccctcgctgg 


gtgtcagcca 


660 


gcccacgcgc 


gcgctcctgt 


tcgtcattct 


ctgcccagtg 


ggtgcctact 


tccctggagg 


720 


ctccgtgacc 


ccggtctccc 


tcctggccga 


cccagccttc 


atccgggcct 


gggtgggcgg 


780 


ggtcggcaac 


tacaagttag 


gtgggaatta 


tgggcccacc 


gtgttagtgc 


aacaggaggc 


840 


actcaagcgg 


ggctgtgaac 


aggtcctctg 


gctgtatggg 


cccgaccacc 


agctcaccga 


900 


ggtgggaacc 


atgaacatct 


ttgtctactg 


gacccacgaa 


gatggggtgc 


tggagctggt 


960 


gacgcccccg 


ctgaatggtg 


ttatcctgcc 


tggagtggtc 


agacagagtc 


tactggacat 


1020 


ggctcagacc 


tggggtgagt 


tccgggtggt 


ggagcgcacg 


atcaccatga 


agcagttgct 


1080 


gcgggccctg 


gaggagggcc 


gcgtgcggga 


agtctttggc 


tcgggcaccg 


cttgccaggt 


1140 


ctgcccagtg 


caccgaatcc 


tgtacaaaga 


caggaacctc 


cacattccca 


ccatggaaaa 


1200 


tgggcctgag 


ctgatcctcc 


gcttccagaa 


ggagctgaag 


gagatccagt 


acggaatcag 


1260 


agcccacgag 


tggatgttcc 


cggtgtgaag 


ctgcaggctg 


tgctccagat 


ccaccgaccc 


1320 


gtagcatctc 


gtcacgccag 


cactcgcctc 


cctaccaatg 


actcacctga 


agtgcaatac 


1380 


gaaataaaag 


gccagcgggc 


ggcgtctggg 


tctctggcgc 


ccccatgtgg 


ttgcgacact 


1440 


cccaaagccg 


taagggccga 


cccaggcatc 


ttggccccca 


gcccctcgtc 


gcgggttcag 


1500 


gtccgcccat 


tactccttgt 


cgtgcggtca 


aggatacacc 


ttggccccga 


ttccggatct 


1560 


ctccgttctc 


aggccagacc 


cctggtgctg 


ccgttgattt 


ttttttctct 


gtctttgctg 


1620 


caattttgaa 


ataaaatgcc 


aaagaacaca 








1650 


<210> 42 

<211> 2040 

<212> DNA 

<213> Homo sapien 












<400> 42 
tccaaattgt 


ttatttatat 


atacaaagaa 


ttatcagggt 


tttccatgga 


gtaaatgccc 


60 


cggataatcc 


tctgaaggaa 


gagcatttag 


tccaacttaa 


tgaaaccgat 


atccttcgcg 


120 


tactggacgg aaacactggc 


ggcacatatg 


gaggccatat 


tcccggatca 


gaccgtgccg 


180 


gtttgaacag 


acacgacaag 


agcgagaacc 


cgggccgaat 


ttacgcgggt 


ggctccagta 


240 


cagctgctgg 


tgacccatct 


ttgctctcag 


ccgcacggat 


catggccgca 


gccgctctgg 


300 


ggcagatctg ggcacgaaag 


cttctctctg 


tcccttggct 


tctgtgtggt 


cccagaagat 


360 


atgcctcctc 


cagtttcaag 


gctgcagacc 


tgcagctgga 


aatgacacag 


aagcctcata 


420 


agaagcctgg 


ccccggcgag 


cccctggtgt 


ttgggaagac 


atttaccgac 


cacatgctga 


480 


tggtggaatg gaatgacaag 


ggctggggcc 


agccccgaat 


ccagcccttc 


cagaacctca 


540 
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cgctgcaccc agcctcctcc agcctccact actccctgca gctgtttgag ggcatgaagg 600 

cgttcaaagg caaagaccag caggtgcgcc tcttccgccc ctggctcaac atggaccgga 660 

tgctgcgctc agccatgcgc ctgtgcctgc cgagtttcga caagctggag ttgctggagt 720 

gcatccgccg gctcatcgaa gtggacaagg actgggtccc cgatgccgcc ggcaccagcc 78 0 

tctatgtgcg gcctgtgctc attgggaacg agccctcgct gggtgtcagc cagcccacgc 840 

gcgcgctcct gttcgtcatt ctctgcccag tgggtgccta cttccctgga ggctccgtga 900 

ccccggtctc cctcctggcc gacccagcct tcatccgggc ctgggtgggc ggggtcggca 960 

actacaagtt aggtgggaat tatgggccca ccgtgttagt gcaacaggag gcactcaagc 1020 

ggggctgtga acaggtcctc tggctgtatg ggcccgacca ccagctcacc gaggtgggaa 1080 

ccatgaacat ctttgtctac tggacccacg aagatggggt gctggagctg gtgacgcccc 114 0 

cgctgaatgg tgttatcctg cctggagtgg tcagacagag tctactggac atggctcaga 1200 

cctgggtgag gacatggcat cttctggtga tgggcgccat gtgccagggg ccagggcatc 1260 

agagggctgg gactggggca cactggcatg tctctgcccc ttctcctggg tctgtctctc 1320 

ccgttgggcc tctgtctttc tcactatcct cagggaggga gaggtggagg tctgcagcag 1380 

ggcaaccctc tggggacacg tgtctctgcc aactgccctg cagggtgagt tccgggtggt 1440 

ggagcgcacg atcaccatga agcagttgct gcgggccctg gaggagggcc gcgtgcggga 1500 

agtctttggc tcgggcaccg cttgccaggt ctgcccagtg caccgaatcc tgtacaaaga 1560 

caggaacctc cacattccca ccatggaaaa tgggcctgag ctgatcctcc gcttccagaa 162 0 

ggagctgaag gagatccagt acggaatcag agcccacgag tggatgttcc cggtgtgaag 1680 

ctgcaggctg tgctccagat ccaccgaccc gtagcatctc gtcacgccag cactcgcctc 174 0 

cctaccaatg actcacctga agtgcaatac gaaataaaag gccagcgggc ggcgtctggg 1800 

tctctggcgc ccccatgtgg ttgcgacact cccaaagccg taagggccga cccaggcatc 1860 

ttggccccca gcccctcgtc gcgggttcag gtccgcccat tactccttgt cgtgcggtca 1920 

aggatacacc ttggccccga ttccggatct ctccgttctc aggccagacc cctggtgctg 1980 

ccgttgattt ttttttctct gtctttgctg caattttgaa ataaaatgcc aaagaacaca 2040 

<210> 43 
<211> 1191 
<212> DNA 
<213> Homo sapien 

<400> 43 

tactcgtgtg ttacagtgtg cccccagatg ttctcccaat tgcgtggcgt tcactgcagc 60 

tgtaacggtc cctgcgtgca tttatgcgct ctttggaccc tgtgaacaca ccgggatact 120 
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ggtaatcttg 


ccccctatgg 


agtacttatg 


/ / 

gcgctcccct 


gtgtttattt 


wtttkggtat 


180 


aaacccttwa 


taascawttt 


aacwyacaka 


awagttcawa 


gawwaagatc 


agaaagaata 


240 


ctgcttcata 


tgtcccaaaa 


gagaaaaaaa 


taaaggggac 


aatgccaaca 


tgctcaacaa 


300 


taaaggcttc 


tttttcttat 


ttttttaata 


caaaatacaa 


gcaaaggata 


cacatactta 


360 


aaacagagct 


caggagcaga 


cacgcagt cc 


tggaaaccct 


tcaataaaag 


caaagcagga 


420 


gtttgttttt 


tctttgtcta 


tgcagataca 


tacagagact 


gggatatgta 


aaaattaagt 


480 


atcacaaaag 


accatcacac 


gattctacca 


atgcatgttg 


catctgtaat 


tcacgaacat 


540 


ggtcaacaaa 


atcatgttca 


cttcaacccc 


atttcattta 


aattaaagaa 


aaaaaccttt 


600 


taaataaagt 


ggttacattc 


aaactttaac 


ttccttagta 


ccatgctgca 


gatttcagca 


660 


ctgttaaggt 


attgcaagaa 


tgcccaaccc 


tctggtgtct 


gatcatgtat 


ctagcaacat 


720 


tgcagtatga 


agaaaagaga 


tgccccggtc 


tcagcccatg 


gactagttaa 


tacagtgaag 


780 


caggttcctg 


tcttttaccc 


ttcctgctca 


gaacataaaa 


gattaaggac 


taaaatcaag 


840 


gaagactggg 


agttttagag 


ctggcaaaat 


gaagtctaaa 


agataaatca 


aggcaaacaa 


900 


ttactgagaa 


cttggctgtt 


gcttaacctg 


gcaagtctaa 


aagcctttct 


ttaaccttgt 


960 


aggaattaga 


tgcataaggt 


ttgctgcaac 


atgttcatgg 


taaacaaact 


aagtagagct 


1020 


cttatttaca 


aatcttgtaa 


caaatacttc 


tggaggaaaa 


agagaaaaga 


attcactaag 


1080 


ttccagaaga 


caagcttaat 


tgcagacgta 


tacaaacaca 


catcacacgt 


aacaccacac 


1140 


atattcgggt 


ttttatacat 


gttattttag 


ggcataagct 


gagtactata 


c 


1191 


<210> 44 
<211> 3097 
<212> DNA 
<213> Homo sapien 












<400> 44 
gagcctacgt 


gacacgacgt 


gaagtggcaa 


tggcggcgga 


cgagggctgt 


gcccccggcc 


60 


cgccccgtta 


ggagccgcga 


gtaaggcgag 


tgtccaacgc 


cgagctggct 


gacagagccc 


120 


ggccgcggcc 


tccgcgggcc 


caggggcctc 


caggccccgt 


gaccaccgga 


ccgagcaccc 


180 


tggaacggcc 


ccagcttggc 


ctcgggacag 


tccgcgcgct 


caccgactcc 


ctggtgaacg 


240 


ccgcttggcc 


gccgccgccg 


ccgcaggacc 


cacgagaagc 


ggaaaccggc 


gcaaggaccc 


300 


ggtccccccg gcgccgtacg 


tggagtgagc 


ctgcggctcc 


gccccgcgcg 


ctccgattgg 


360 


ctctggggcc 


tggcccgccc 


cttcccgaca 


ctgtcattgg 


cctaggcaaa 


gccgtaaggg 


420 


tggggaatcc 


tattggccca 


ggggtgcggc 


ttgtcctctc 


ccggtgctct 


cattggccga 


480 


gcgcggccgt 


tgggggtgag 


gccgcgtcgg 


gggaggacaa 


caaagggccg 


cgggcggcgg 


540 
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gcagtggtgt 


cccagtctcc 


cggtgcttcc 


ctgaggctga 


ggcgcccggc 


ctcccgcccg 


600 


ccgcgctcca 


gatgaagtgt 


gagcactgca 


cgcgcaagga 


atgtagtaag 


aaaacaaaaa 


660 


ctgatgacca 


agagaatgtg 


tcagccgatg 


caccgagtcc 


agcccaggaa 


aatggagaga 


720 


agggagaatt 


ccacaagttg 


gctgatgcca 


agatattttt 


gagcgactgc 


ctggcatgtg 


780 


acagctgtat 


gactgcagag 


gaaggagtcc 


aactttccca 


gcaaaatgcc 


aaggacttct 


840 


tccgcgttct 


gaaccttaac 


aagaaatgtg 


atacctcaaa 


gcacaaagtg 


ctggtagtgt 


900 


ctgtgtgtcc 


tcaatctttg 


ccttattttg 


ctgctaaatt 


caacctcagt 


gtaactgatg 


960 


catccagaag 


actctgtggt 


ttcctcaaaa 


gtcttggggt 


gcactatgta 


tttgatacga 


1020 


cgatagctgc 


ggattttagt 


atcctggaga 


gtcaaaaaga 


attcgtgcgt 


cgctatcgcc 


1080 


agcacagtga 


ggaggaacgc 


accctgccca 


tgctgacctc 


tgcctgtcct 


ggctgggtcc 


1140 


gatacgccga 


gcgggtgctg 


ggtcgcccca 


tcactgccca 


cctctgcacc 


gccaagtccc 


1200 


cccagcaggt 


catgggctct 


ttggtgaagg 


attatttcgc 


cagacagcag 


aacctgtctc 


1260 


cagagaagat 


tttccacgtc 


attgtggccc 


cttgttatga 


caagaagctg 


gaggctcttc 


1320 


aggaaagcct 


tccccctgct 


ttgcatggct 


cccggggcgc 


tgactgcgtg 


ttaacatcag 


1380 


gtgaaattgc 


tcaaataatg 


gagcaaggtg 


acctctcagt 


gagagatgct 


gccgtcgaca 


1440 


ctcttgtaag 


tggcttctct 


ggggagagtc 


ctctgggggg 


caggacctct 


cgtcagccct 


1500 


gccagccacc 


tgccaggccc 


agggctgccc 


tgctttatga 


cgaggccatg 


tgaagcactg 


1560 


cgatgcagtc 


ccccaggccc 


ctgccacaac 


cctcagccac 


cctatagctg 


ttgaatggga 


1620 


taccattcac 


cagggctgcc 


agggagttga 


aggtttggcg 


ggcttccatg 


gtgtgattgc 


1680 


caccctaaac 


ccttgaggaa 


ggacaaaatg 


cacaaacagc 


tttgtcctcg 


ctggcgcagg 


1740 


gctgttcatg 


tgtgcaccag 


cccccttctt 


cctgagtgcc 


gtctgtgcct 


cacattccgt 


1800 


gtggtctatg 


gggcagtggg 


gcagctccct 


caggcccctt 


tgcaccaaga 


gtggcagtct 


1860 


gggttctgct 


gggggcggtg 


acctcgatac 


ccagcaaggc 


ccagggctcc 


tgctgcaccc 


1920 


caggcactgg 


gcagagagca 


ccacaaaggc 


agcagcgtgt 


gtcctcgcca 


acagtccgca 


1980 


tcccaggtgc 


tgcattcgtg 


cccttttggg 


ggaaggtaag 


atagaagttt 


acagaattta 


2040 


agcatctgtg 


gggccaggcg 


tggtggctca 


cacctataat 


cacagtactt 


tgggaggctg 


2100 


aggtaggagg 


atcacttgag 


tccaggaggc 


tggggctgta 


gtgagctatg 


attgtgccac 


2160 


tgcactccag 


cctgagagac 


agaacaagac 


cctgtctcca 


aaagaaaaag 


accataagca 


2220 


tctggagcac 


ctgcaagagc 


ctgtgcatgg 


gctcggcgtg 


actgagagtc 


ctccgccttg 


2280 


cccgagcatc 


aggcaggccc 


tcccaagacc 


cctcaagatg 


ctagaatggg 


actttctgca 


2340 


aagtttaaac 


atctgtacag 


gatttagctt 


ctttggtggc 


gaacttggat 


tgtgatgccc 


2400 
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tcaggaagtc tggtttcact ctttctgtca ctgtacgtca gcattcatga aggacttgta 2460 

ttttctctgc cttgtgaggt ttggagactt gaaggaggac aaagtgacgc gtcatgatgg 2520 

agccagctca gacgggcacc tggcacacat cttcagacat gcggccaagg agctgttcaa 2580 

cgaggatgtg gaggagaaac aaagacttcc aagaggtcac ccttgagaag aacggagagg 2640 

tggtgttacg ctttgctgca gcctatggct ttcgaaacat ccagaacatg atcctgaagc 2700 

ttaagaaggg caagttccca ttccactttg tggaggtcct cgcctgtgct ggaggatgct 2760 

taaatggcag aggccaagcc cagactccag acggacatgc ggataaggcc ctgctgcggc 2820 

agatggaagg catttacgct gacatccctg tgcggcgtcc ggagtccagt gcacacgtgc 2880 

aggagctgta ccaggagtgg ctggagggga tcaactcccc caaggcccga gaggtgctgc 2940 

ataccacgta ccagagccag gagcgtggca cacacagcct ggacatcaag tggtgaagtc 3000 

aggccagggc cttccagctg ctcttggggc cagagccaag agcctctcag tagagggagg 306 0 

ggctgccctg agtggagtat taaagacact taagaaa 3097 

<210> 45 

<211> 3709 

<212> DNA 

<213> Homo sapien 

<400> 45 

gagcctacgt gacacgacgt gaagtggcaa tggcggcgga cgagggctgt gcccccggcc 60 

cgccccgtta ggagccgcga gtaaggcgag tgtccaacgc cgagctggct gacagagccc 120 

ggccgcggcc tccgcgggcc caggggcctc caggccccgt gaccaccgga ccgagcaccc 180 

tggaacggcc ccagcttggc ctcgggacag tccgcgcgct caccgactcc ctggtgaacg 240 

ccgcttggcc gccgccgccg ccgcaggacc cacgagaagc ggaaaccggc gcaaggaccc 300 

ggtccccccg gcgccgtacg tggagtgagc ctgcggctcc gccccgcgcg ctccgattgg 3 60 

ctctggggcc tggcccgccc cttcccgaca ctgtcattgg cctaggcaaa gccgtaaggg 42 0 

tggggaatcc tattggccca ggggtgcggc ttgtcctctc ccggtgctct cattggccga 480 

gcgcggccgt tgggggtgag gccgcgtcgg gggaggacaa caaagggccg cgggcggcgg 540 

gcagtggtgt cccagtctcc cggtgcttcc ctgaggctga ggcgcccggc ctcccgcccg 600 

ccgcgctcca gatgaagtgt gagcactgca cgcgcaagga atgtagtaag aaaacaaaaa 660 

ctgatgacca agagaatgtg tcagccgatg caccgagtcc agcccaggaa aatggagaga 720 

agggagaatt ccacaagttg gctgatgcca agatattttt gagcgactgc ctggcatgtg 780 

acagctgtat gactgcagag gaaggagtcc aactttccca gcaaaatgcc aaggacttct 840 

tccgcgttct gaaccttaac aagaaatgtg atacctcaaa gcacaaagtg ctggtagtgt 900 
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ctgtgtgtcc tcaatctttg ccttattttg ctgctaaatt caacctcagt gtaactgatg 960 

catccagaag actctgtggt ttcctcaaaa gtcttggggt gcactatgta tttgatacga 1020 

cgatagctgc ggattttagt atcctggaga gtcaaaaaga attcgtgcgt cgctatcgcc 1080 

agcacagtga ggaggaacgc accctgccca tgctgacctc tgcctgtcct ggctgggtcc 1140 

gatacgccga gcgggtgctg ggtcgcccca tcactgccca cctctgcacc gccaagtccc 1200 

cccagcaggt catgggctct ttggtgaagg attatttcgc cagacagcag gtaagctgac 1260 

ctctctggag ggcaggaagg gcaggtgagg gaggaccatg gcacaggggc ccaggaaggg 1320 

gaggttcccc atctgtccag cgtggtcacg gccccccagg tgagcaaaaa ggaagctgtg 1380 

gcttcaggaa acatcagcag tgctgtttct caccaacgcc cgtctctatt cctcccttcc 144 0 

tcttacaacc tttttctttc ttcttaacct tttctgtggc atctggtggt tttcctgggc 1500 

ccattcgggg tagagtagaa atgctgccca ccctgctgag aatgaggccc cggaccttgc 1560 

ccagagatgc tgcctgcctg ctgttgggat gcctcaccca aggaggccta aagtggcctg 1620 

tgcctggagg ctgagggcag ctgtcattta gggagccaaa cctggcctct gccctggcct 1680 

ttccctcaga ccacgggcat cacatgactt ccttagccac ttctgtgaga ttccacagtg 1740 

gccatgatgc tgtcctcctc ttgatggagt gagtcccggc ctgaggctcc tgcgtgcccc 1800 

ctgcctcggc ccgtcgtgtt ctcccttccc acaaccctct gcggccccca ggtcctccca 1860 

gaactccctg atgtctccaa ggccaccctc aaagggaccc taatcccatg gtttattctc 1920 

cagccgctgg catgccccac actgctcttc tcaatcagac tttacacctc cctccgggga 1980 

agcagccatg tccaggcact gtggtaactg gagtagaggg taacatcagg atctttcaga 2040 

catttgaagt cagcctaaaa aggcacgcac atttggcagc cttgttattt tagcaatagg 2100 

cagtgttttc cttctatcta gtttctttct acccataaag tcataatttg gcaaattata 2160 

aacatgatgt cagagtatac acgttcagtt taggcaggac cgaaaaatga gctatgggac 2220 

tttctgtccc ttgccctgcc cccacctgaa gagagccatg tgtttctagg tcttccacgt 2280 

gctgccctct cccctggatg gagcccagag ctatgtgtcc tcctgtaggt gccatgaggg 2340 

gcagccgcag agacccggcc tccagcagcc tctcacctga ccacctggca cagctgcaac 2400 

caggggcggg tttcctcttt tgctccgccc ctgccctgag tgggtccttt ccacccagac 2460 

ctcaccacag ggtcacttgg caactgctgg ctcctggctt tgccaccagt tcaggcccag 2520 

ccacaggcac ctgctctgtg tggacccagg agaggggtag cttgggtggg catcgcttgg 2580 

gaagattcag gaaagtctgt gggtgaagta gccagcgcgc acacacacac acacacacac 2640 

acactcactc aggcctcctg caatggctcc atgctggacc agcaacagct ccacgccggc 2700 
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tggacttcct cttacttgtt ttgtccccat gtgcacaggt tcaggaggaa atgggctcag 2760 

ccttcccggt tcttccagga ggctccactg gctggggggc tttgcagatg tttgggagga 2820 

ccacacccag tcctgaggga ggcagcagac agacatggat ggagtgctgg tgctcgaacc 2880 

tgtctccaga gaagattttc cacgtcattg tggccccttg ttatgacaag aagctggagg 2940 

ctcttcagga aagccttccc cctgctttgc atggctcccg gggcgctgac tgcgtgttaa 3000 

catcaggtga aattgctcaa ataafcggagc aaggtgacct ctcagtgaga gatgctgccg 3 060 

tcgacactct gtttggagac ttgaaggagg acaaagtgac gcgtcatgat ggagccagct 3120 

cagacgggca cctggcacac atcttcagac atgcggccaa ggagctgttc aacgaggatg 3180 

tggaggaggt cacttaccga gccctgagaa acaaagactt ccaagaggtc acccttgaga 3240 

agaacggaga ggtggtgtta cgctttgctg cagcctatgg ctttcgaaac atccagaaca 33 00 

tgatcctgaa gcttaagaag ggcaagttcc cattccactt tgtggaggtc ctcgcctgtg 3360 

ctggaggatg cttaaatggc agaggccaag cccagactcc agacggacat gcggataagg 3420 

ccctgctgcg gcagatggaa ggcatttacg ctgacatccc tgtgcggcgt ccggagtcca 3480 

gtgcacacgt gcaggagctg taccaggagt ggctggaggg gatcaactcc cccaaggccc 3540 

gagaggtgct gcataccacg taccagagcc aggagcgtgg cacacacagc ctggacatca 3600 

agtggtgaag tcaggccagg gccttccagc tgctcttggg gccagagcca agagcctctc 3 660 

agtagaggga ggggctgccc tgagtggagt attaaagaca cttaagaaa 3709 

<210> 46 

<211> 1840 

<212> DNA 

<213> Homo sapien 

<400> 46 

cgcattctcc tctctgctaa gctttctcat gctagagaat gatgactaaa aaacagatga 60 

tcccatcttc t tat t caeca tcccctaaaa aaaacagact gccacgtgga atgtgtcagc 120 

ctcacctggg ggagacaccc ttatctgacg acttaaaagg aaaaacttgg tgagccagcc 180 

tccacacaaa cacacgcata gccttgcaca aaacaaaaat ttattgggag aaagatggct 240 

gatgggaggc aagatctgga ttcaggaatc ctcaatttta gctctcgctc tgtcaccaaa 300 

accctccgtc cctcccagag cctcgaggag tttccccttg tgtaaacegg tatcgcgtcc 360 

cttccagctc tgacatcctg eggttctaga atacccattg attgaaagag ctgtggccaa 420 

acaggaagga cagagtaaga gccgctgaca aaggattctg ccaagacaga atcccaaggt 480 

cacaggagag aggcttgggg cagtgaaagc cctctcgggt cttggctttg gaagtgtgga 540 

ctgtaccagc agatgggece ttgggaagcc aacagggaac aattcaaggc tggagagaaa 600 
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aggccacctt tgacccagca gaaggcagag gcgggagagg agggtgagga tgtacaagga 660 

gcatcgcagg cgaggaaaca atggccagga cctaactgtg gtgggaactg cctttgtctc 720 

cacacactcg caatcaacat gcgtatttgc tattctcaaa caactccctt ccaccccctt 780 

aggctgaaag gacaaactca tcagagttga ggagtaccta cctgatgaag ctgttcatgc 840 

ttccagcatc aaactggacg tttactgtct gcccaaaagc cagtttccaa aaggtttgct 900 

tccctctgtt cagtggattc ttgactacat ataggtcata tatttcaaaa aataatgcct 960 

agctatttct actttgaaat catgactaaa gccaaaccac aaccacagca aagataacct 1020 

aaggatttgt ttaccagaaa tacctacaaa aaagtttgca ggccgggcgc gatggctcac 1080 

acttgtaatc ccagcacttt gagaggccga gatgggcaga tcacctgagg tcaggggttc 1140 

aagaccatcc tgcccaacat gatgaaacct ggtctctact aaaaatacaa aaattaaccg 12 00 

ggtgtggtgg cgcatgcctg taatcccagc tactcaggag actgaggcgg aattgccaga 1260 

accctggagg gcagaggttg cagtaagctg agatcgggcc actgcactcc agcctgggtg 1320 

acagaatgag actcagtctc ccaccctgcc aaaaagtttg caatgtcggc agcaaagtgc 13 80 

aaacagagcc ggctctgggt gaaagcttta gttatgaaaa ataagaaaaa aaaaatccct 144 0 

aagcatttct taacactagt tttaaagaaa accccctctc caaaggtatg tttctccttc 1500 

aactttcttg actttgccag gctgtcagga tgcaagctac ctactctttt ctggatggat 1560 

cgggacaaag tttttaaaca aagactgtgt cctctccaca agcccttccc accaccacca 162 0 

ccccagcccc ccgccgccat aattacaggg gctgtgaaat ggctgttgag tgatggtcac 1680 

acaaccagaa ggcagatgaa aaggtcaggc tctaagaggg gttcagcctc ccagtaccag 1740 

cctgctgtgc ccaggggtgg gagtgctggg agaactgtct tccccagaca ggcagctatg 1800 

ccgcctccca ctgcaaaggc accaaaggca acatccgtcc 1840 

<210> 47 

<211> 1618 

<212> DNA 

<213> Homo sapien 

<400> 47 

cgtcgacggg atggagggtt ttggtgacag agcgagagct aaaattgagg attcctgaat 60 

ccagatcttg cctcccatca gccatctttc tcccagagcc tcgaggagtt tccccttgtg 120 

taaaccggta tcgcgtccct tccagctctg acatcctgcg gttctagaat acccattgat 180 

tgaaagagct gtggccaaac aggaaggaca gagtaagagc cgctgacaaa ggattc.tgcc 240 

aagacagaat cccaaggtca caggagagag gcttggggca gtgaaagccc tctcgggtct 300 

tggctttgga agtgtggact gtaccagcag atgggccctt gggaagccaa cagggaacaa 360 
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ttcaaggctg gagagaaaag gccacctttg acccagcaga aggcagaggc gggagaggag 420 

ggtgaggatg tacaaggagc atcgcaggcg aggaaacaat ggccaggacc taactgtggt 480 

gggaactgcc tttgtctcca cacactcgca atcaacatgc gtatttgcta ttctcaaaca 540 

actcccttcc acccccttag gctgaaagga caaactcatc agagttgagg agtacctacc 600 

tgatgaagct gttcatgctt ccagcatcaa actggacgtt tactgtctgc ccaaaagcca 660 

gtttccaaaa ggtttgcttc cctctgttca gtggattctt gactacatat aggtcatata 72 0 

tttcaaaaaa taatgcctag ctatttctac tttgaaatca tgactaaagc caaaccacaa 780 

ccacagcaaa gataacctaa ggatttgttt accagaaata cctacaaaaa agtttgcagg 84 0 

ccgggcgcga tggctcacac ttgtaatccc agcactttga gaggccgaga tgggcagatc 900 

acctgaggtc aggggttcaa gaccatcctg cccaacatga tgaaacctgg tctctactaa 960 

aaatacaaaa attaaccggg tgtggtggcg catgcctgta atcccagcta ctcaggagac 1020 

tgaggcggaa ttgccagaac cctggagggc agaggttgca gtaagctgag atcgggccac 1080 

tgcactccag cctgggtgac agaatgagac tcagtctccc accctgccaa aaagtttgca 1140 

atgtcggcag caaagtgcaa acagagccgg ctctgggtga aagctttagt tatgaaaaat 1200 

aagaaaaaaa aaatccctaa gcatttctta acactagttt taaagaaaac cccctctcca 1260 

aaggtatgtt tctccttcaa ctttcttgac tttgccaggc tgtcaggatg caagctacct 1320 

actcttttct ggatggatcg ggacaaagtt tttaaacaaa gactgtgtcc tctccacaag 138 0 

cccttcccac caccaccacc ccagcccccc gccgccataa ttacaggggc tgtgaaatgg 144 0 

ctgttgagtg atggtcacac aaccagaagg cagatgaaaa ggtcaggctc taagaggggt 1500 

tcagcctccc agtaccagcc tgctgtgccc aggggtggga gtgctgggag aactgtcttc 1560 

cccagacagg cagctatgcc gcctcccact gcaaaggcac caaaggcaac atccgtcc 1618 

<210> 48 

<211> 3252 

<212> DNA 

<213> Homo sapien 

<400> 48 

agggggctct gggcagacgt aggcgccggg ctgcaccttc cctgcccgcc tgtttgcact 60 

tctctgtgac tgttggcttc aggtccttag ccagcggagg gctgactccg gcctggctgc 120 

cccctcccct ttgcctcccg tccctccccc tggacactcc aacccccatc ccttcgtgtc 180 

cctccccctc ctctctctat gctactccct agccagaccc cccgctcttc cccggaacaa 240 

tggaggggaa tgtggctagg ccagcctgac gctgctgcgt caacctctgg ctcgttctcg 300 

gcctctatcg cttccaggga cccttctatc tgggctcagc tgacccgccc cgcctctcgg 360 
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cccacgtgcc ggcagttctc ccaggctggg get cage tgc gctggagact gggatgggtc 420 

agctcagtgt ctctgtctgt ttacaaagga ggaacccagg tgeactaage aggcccccca 480 

gtcctgggtg caggcttgtg acagctgctc agggatccaa acgtggcgct tgagctctct 54 0 

gcgatgtgtt ccacagttcc ctggaggcgg acactgegtg agggtcagee atgtcatctg 600 

agcctccccc accaccacag ccccccaccc atcaagcttc agtcgggctg ctggacaccc 660 

ctcggagccg tgagegctea ccatcccctc tgcgcggcaa cgtggtccca agcccactgc 720 

ccactcgccg gacgaggacc ttcteggega eggtgeggge ttcacagggc cccgtctaca 780 

aaggagtctg caaatgette tgccggtcca agggecatgg cttcattacc ccagctgatg 840 

gcggccccga catcttcctg cacatctctg atgtggaagg ggagtatgtc ccagtggaag 900 

gcgacgaggt cacctataaa atgtgctcca tcccacccaa gaatgagaag ctgcaggccg 960 

tggaggtcgt catcactcac ctggcaccag gcaccaagca tgagacctgg tctggacatg 1020 

tcatcagctc ctaggagatg gtggaagcac cccttgtcct gtgcttgtgg gagactttgc 1080 

ggggaggagg cagcagacac tggagatgac attcttccac aegagaeggg gcttcagccg 1140 

ggcatggtcc ctctcaagta tctcctggag gaaggggtat ggggggcagg tgtggggtgt 12 00 

ggggtgttcc cggccatcag cacagcctat gaecattgea acaacctctc accatctgaa 1260 

gagcattaaa agcatttaaa aaggagaggt gcccactggt ggctgagtgg aggttccaac 1320 

cccatcccag ggagtggatc aagggtggta tttctccagc tgctcagaca catgggctca 1380 

acccacagaa tccctcttcc tcctggagct ggaggcccca gattcccaga tctggccccc 1440 

tggcagcctg acagggacct tgcgtgactt ctccaaggca aatttccacc taagtgcccc 1500 

ttgcgcctct cctggggcct gggcaaagca gttttctaat tcttggcttg gttggttcta 1560 

ggggagctgg cttgaagtgg gtggggaaag gcgggggtgg eggtctttgg attggacgga 162 0 

tgttgccttt tggtgccttt gcagtgggag geggcatage tgcctgtctg gggaagacag 1680 

ttctcccagc actcccaccc ctgggcacag caggctggta ctgggaggct gaacccctct 1740 

tagagectga ccttttcatc tgccttctgg ttgtgtgacc atcactcaac agecatttea 1800 

cagcccctgt aattatggcg gegggggget ggggtggtgg tggtgggaag ggcttgtgga 1860 

gaggacacag tctttgttta aaaactttgt cccgatccat ccagaaaaga gtaggtagct 1920 

tgcatcctga cagcctggca aagtcaagaa agttgaagga gaaacatacc tttggagagg 1980 

gggttttctt taaaactagt gttaagaaat gcttagggat tttttttttc ttatttttca 2040 

taactaaagc tttcacccag agccggctct gtttgeaett tgctgccgac attgeaaact 2100 

ttttggcagg gtgggagact gagtctcatt ctgtcaccca ggctggagtg cagtggcccg 2160 

atctcagctt actgcaacct ctgccctcca gggttctggc aattccgcct cagtctcctg 2220 



WO 2004/050900 



PCT/US2003/040131 



85 



agtagctggg 


attacaggca 


tgcgccacca 


cacccggtta 


atttttgtat 


ttttagtaga 


2280 


gaccaggttt 


catcatgttg 


ggcaggatgg 


tcttgaaccc 


ctgacctcag 


gtgatctgcc 


2340 


catctcggcc 


tctcaaagtg 


ctgggattac 


aagtgtgagc 


catcgcgccc 


ggcctgcaaa 


2400 


cttttttgta 


ggtatttctg 


gtaaacaaat 


ccttaggtta 


tctttgctgt 


ggttgtggtt 


2460 


tggctttagt 


catgatttca 


aagtagaaat 


agctaggcat 


tattttttga 


aatatatgac 


2520 


ctatatgtag 


tcaagaatcc 


actgaacaga 


gggaagcaaa 


ccttttggaa 


actggctttt 


2580 


gggcagacag 


taaacgtcca 


gtttgatgct 


ggaagcatga 


acagcttcat 


caggtaggta 


2640 


ctcctcaact 


ctgatgagtt 


tgtcctttca 


gcctaagggg 


gtggaaggga 


gttgtttgag 


2700 


aatagcaaat 


acgcatgttg 


attgcgagtg 


tgtggagaca 


aaggcagttc 


ccaccacagt 


2760 


taggtcctgg 


ccagtttgtt 


tcctcgcctg 


cgatgctcct 


tgtacgtcct 


caccctcctc 


2820 


tcccgcctct 


gccttctgct 


gggtcaaagg 


tggccttttt 


ctctccagcc 


ttgaattgtt 


2880 


ccctgttggc 


ttcccaaggg 


cccatctgct 


ggtacagtcc 


acacttccaa 


agccaagacc 


2940 


cgagagggct 


ttcactgccc 


caagcctctc 


tcctgtgacc 


ttgggattct 


gtcttggcag 


3000 


aatcctttgt 


cagcggctct 


tgctctgtcc 


ttcctgtttg 


gccacagctc 


tttcaatcaa 


3060 


tgggtattct 


agaaccgcag 


gatgtcagag 


ctggaaggga 


cgcgataccg 


gtttacacaa 


3120 


ggggaaactc 


ctcgaggctc 


tgggagggac 


ggagggtttt 


ggtgacagag 


cgagagctaa 


3180 


aattgaggat 


tcctgaatcc 


agatcttgcc 


tcccatcagc 


catctttctc 


ccaataaatt 


3240 


tttgttttgt 


gc 












<210> 49 

<211> 3014 

<212> DNA 

<213> Homo sapien 












<400> 49 
ggtcatgcgg 


gtgccctggc 


ccccaggcca 


gcagctgggg 


cctttttgag 


gtgggggctc 


60 


cccggactca 


ggagttcctc 


cttggccgcg 


gctgtagctg 


agagaaggga 


caggaggtag 


120 


gatgccacct 


cctccagaag 


cgtggtgacc 


cgggctgtgt 


ggaaattcac 


tgggcctgat 


180 


ggagctggat 


gcagccttgg 


aaattctccc 


tttccctttg 


ggctcccgca 


gcaggccccc 


240 


cagtcctggg 


tgcaggcttg 


tgacagctgc 


tcagggatcc 


aaacgtggcg 


cttgagctct 


300 


ctgcgatgtg 


ttccacagtt 


ccctggaggc 


ggacactgcg 


tgagggtcag 


ccatgtcatc 


360 


tgagcctccc 


ccaccaccac 


agccccccac 


ccatcaagct 


tcagtcgggc 


tgctggacac 


420 


ccctcggagc 


cgtgagcgct 


caccatcccc 


tctgcgcggc 


aacgtggtcc 


caagcccact 


480 


gcccactcgc 


cggacgagga 


ccttctcggc 


gacggtgcgg 


gcttcacagg 


gccccgtcta 


540 
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caaaggagtc tgcaaatgct tctgccggtc caagggccat ggcttcatta ccccagctga 600 

tggcggcccc gacatcttcc tgcacatctc tgatgtggaa ggggagtatg tcccagtgga 660 

aggcgacgag gtcacctata aaatgtgctc catcccaccc aagaatgaga agctgcaggc 720 

cgtggaggtc gtcatcactc acctggcacc aggcaccaag catgagacct ggtctggaca 780 

tgtcatcagc tcctaggaga tggtggaagc accccttgtc ctgtgcttgt gggagacttt 840 

gcggggagga ggcagcagac actggagatg acattcttcc acacgagacg gggcttcagc 900 

cgggcatggt ccctctcaag tatctcctgg aggaaggggt atggggggca ggtgtggggt 960 

gtggggtgtt cccggccatc agcacagcct atgaccattg caacaacctc tcaccatctg 1020 

aagagcatta aaagcattta aaaaggagag gtgcccactg gtggctgagt ggaggttcca 1080 

accccatccc agggagtgga tcaagggtgg tatttctcca gctgctcaga cacatgggct 1140 

caacccacag aatccctctt cctcctggag ctggaggccc cagattccca gatcfcggccc 12 00 

cctggcagcc tgacagggac cttgcgtgac ttctccaagg caaatttcca cctaagtgcc 1260 

ccttgcgcct ctcctggggc ctgggcaaag cagttttcta attcttggct tggttggttc 1320 

taggggagct ggcttgaagt gggtggggaa aggcgggggt ggcggtcttt ggattggacg 138 0 

gatgttgcct tttggtgcct ttgcagtggg aggcggcata gctgcctgtc tggggaagac 144 0 

agttctccca gcactcccac ccctgggcac agcaggctgg tactgggagg ctgaacccct 1500 

cttagagcct gaccttttca tctgccttct ggttgtgtga ccatcactca acagccattt 1560 

cacagcccct gtaattatgg cggcgggggg ctggggtggt ggtggtggga agggcttgtg 162 0 

gagaggacac agtctttgtt taaaaacttt gtcccgatcc atccagaaaa gagtaggtag 1680 

cttgcatcct gacagcctgg caaagtcaag aaagttgaag gagaaacata cctttggaga 174 0 

gggggttttc tttaaaacta gtgttaagaa atgcttaggg attttttttt tcttattttt 1800 

cataactaaa gctttcaccc agagccggct ctgtttgcac tttgctgccg acattgcaaa 1860 

ctttttggca gggtgggaga ctgagtctca ttctgtcacc caggctggag tgcagtggcc 1920 

cgatctcagc ttactgcaac ctctgccctc cagggttctg gcaattccgc ctcagtctcc 1980 

tgagtagctg ggattacagg catgcgccac cacacccggt taatttttgt atttttagta 2 04 0 

gagaccaggt ttcatcatgt tgggcaggat ggtcttgaac ccctgacctc aggtgatctg 2100 

cccatctcgg cctctcaaag tgctgggatt acaagtgtga gccatcgcgc ccggcctgca 2160 

aacttttttg taggtatttc tggtaaacaa atccttaggt tatctttgct gtggttgtgg 2220 

tttggcttta gtcatgattt caaagtagaa atagctaggc attatttttt gaaatatatg 2280 

acctatatgt agtcaagaat ccactgaaca gagggaagca aaccttttgg aaactggctt 234 0 
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ttgggcagac 


agtaaacgtc 


cagtttgatg 


B7 

ctggaagcat 


gaacagcttc 


atcaggtagg 


2400 


tactcctcaa 


ctctgatgag 


tttgtccttt 


cagcctaagg 


gggtggaagg 


gagttgtttg 


2460 


agaatagcaa 


atacgcatgt 


tgattgcgag 


tgtgtggaga 


caaaggcagt 


tcccaccaca 


2520 


gttaggtcct 


ggccagtttg 


tttcctcgcc 


tgcgatgctc 


cttgtacgtc 


ctcaccctcc 


2580 


tctcccgcct 


ctgccttctg 


ctgggtcaaa 


ggtggccttt 


ttctctccag 


ccttgaattg 


•2640 


ttccctgttg 


gcttcccaag 


ggcccatctg 


ctggtacagt 


ccacacttcc 


aaagccaaga 


2700 


cccgagaggg 


ctttcactgc 


cccaagcctc 


tctcctgtga 


ccttgggatt 


ctgtcttggc 


2760 


agaatccttt 


gtcagcggct 


cttgctctgt 


ccttcctgtt 


tggccacagc 


tctttcaatc 


2820 


aatgggtatt 


ctagaaccgc 


aggatgtcag 


agctggaagg 


gacgcgatac 


cggtttacac 


2880 


aaggggaaac 


tcctcgaggc 


tctgggaggg 


acggagggtt 


ttggtgacag 


agcgagagct 


2940 


aaaattgagg 


attcctgaat 


ccagatcttg 


cctcccatca 


gccatctttc 


tcccaataaa 


3000 


tttttgtttt 


gtgc 










3014 


<210> 50 

<211> 2716 

<212> DNA 

<213> Homo sapien 












<400> 50 
gctcaggacc 


ggtcggaatt 


ccgggggcgg 


acagaacgga 


ttgcagggtc 


agccatgtca 


60 


tctgagcctc 


ccccaccacc 


acagcccccc 


acccatcaag 


cttcagtcgg 


gctgctggac 


120 


acccctcgga 


gccgtgagcg 


ctcaccatcc 


cctctgcgcg 


gcaacgtggt 


cccaagccca 


180 


ctgcccactc 


gccggacgag 


gaccttctcg 


gcgacggtgc 


gggcttcaca 


gggccccgtc 


240 


tacaaaggag 


tctgcaaatg 


cttctgccgg 


tccaagggcc 


atggcttcat 


taccccagct 


300 


gatggcggcc 


ccgacatctt 


cctgcacatc 


tctgatgtgg 


aaggggagta 


tgtcccagtg 


360 


gaaggcgacg 


aggtcaccta 


taaaatgtgc 


tccatcccac 


ccaagaatga 


gaagctgcag 


420 


gccgtggagg 


tcgtcatcac 


tcacctggca 


ccaggcacca 


agcatgagac 


ctggtctgga 


480 


catgtcatca 


gctcctagga 


gatggtggaa 


gcaccccttg 


tcctgtgctt 


gtgggagact 


540 


ttgcggggag 


gaggcagcag 


acactggaga 


tgacattctt 


ccacacgaga 


cggggcttca 


600 


gccgggcatg 


gtccctctca 


agtatctcct 


ggaggaaggg 


gtatgggggg 


caggtgtggg 


660 


gtgtggggtg 


ttcccggcca 


tcagcacagc 


ctatgaccat 


tgcaacaacc 


tctcaccatc 


720 


tgaagagcat 


taaaagcatt 


taaaaaggag 


aggtgcccac 


tggtggctga 


gtggaggttc 


780 


caaccccatc 


ccagggagtg 


gatcaagggt 


ggtatttctc 


cagctgctca 


gacacatggg 


840 


ctcaacccac 


agaatccctc 


ttcctcctgg 


agctggaggc 


cccagattcc 


cagatctggc 


900 
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cccctggcag 


CCtqacaqqq 

-3 -3-3-3 


accttgegtg 


ccccttgcgc 


ctctcctggg 


qcctqqqcaa 

3 < -^- l -333 v '"" 


tctaqgggaq 


ctggcttgaa 


qtqqqtqqqq 

3 ^-333 w 3333 


cggatqttqc 

w^^w w^ w w^ w 


cttttggtgc 


CtttQCaQtQ 

V W *■•» - V^\**^J w 


acagttctcc 


cagcactccc 


accccfcgggc 


ctcttagagc 


ctgacct t t t 


catctgcct t 


ttcacagccc 


ctgtaattat 


qqcqqcqqqq 


tggagaqgac 


acagtctttg 


tttaaaaact 


agcttgcatc 


ctgacagcct 


ggcaaagt ca 


qaqqqqq 1 1 1 


tctt taaaac 


t acrfc cr 1 1 aacr 


ttcataacta 


aagctttcac 


ccacracrcccrcf 


aactttttgg 


c acrcf cr t acicf a 


aactaaafcet 


cccgatctca 


gcttactgca 


acctctgccc 


cctgagtagc 


tgggat taca 


ggcatgcgcc 


tagagaccag 


gt t teat cat 


crt t orcrcrcacjcf 

3 u ^333 wC *33 


tgcccatctc 


cracctctcaa 


B 3 l *3^' l *333 a 


caaacttt tt 


tataaatatt 


tctaataaac 

w w w L aaa w 


qqtttqqctt 


tagtcatgat 


t t caaagtag 


tgacctatat 


Qtaatcaaqa 


atccactgaa 


ttt tQQQCaq 

www *-333v^3 


acagtaaacg 


tccaatttaa 


ggtactcctc 


aactctgatg 


agtttgtcct 


fcgagaatagc 


aaat aegcat 


cr 1 1 era 1 1 cr ccr 

3 wy c* w ^*3 


cagttaggtc 


ctggccagtt 


t crt 1 1 c c t ccr 

»g www wi w> w VNj 


cctctcccgc 


ctctgccttc 


tcfctcrcrat ca 


tgttccctgt 


tggcttccca 


aaaacccatc 

333 ww* w w 


qa c c caaqaq 

13 3 -3 -3 


ggct ttcact 


accccaacrcc 


gcagaatcct 


ttgtcagcgg 


ctcttgctct 


tcaatgggta 


ttctagaacc 


gcaggatgtc 


acaaggggaa 


actcctcgag 


gctctgggag 


ctaaaattga 


ggattcctga 


atccagatct 


aatttttgtt 


ttgtgc 
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act tct ccaa 


crcr caaa t 1 1 c 


cacefcaacrtcr 


960 


agcagt tttc 


taattct tgg 


cttcrcrfc tcrcrt 

w UMM w w w 


1020 


aaacrcrccrcrcra 


cr t crcr ccrcr t c t 

y^yy^yy wwu 


1 1 crcra 1 1 crcra 
w Lyy gl w uyya 


1UOU 


ocf acr a ccrcrc a 

3J u 33^J3 oa 


tacfctcrcctcr 

w S*y v_ wy w w J 


fc c t crcr crcra a cr 
ww L yyyy at *y 


1140 


acagcaggct 


crcr t a c t cr crcra 


crcr c t* era ?i rrr 
yywL.yac3>wOw 




c tcrcrt t crtcrt 

v ' u 33 L w 3 ^"3 


oaccatcaet 

yas>va w vu^ w 


caa parrppat" 




qq c t crcrcrcr t cr 

33 w,w 3333 ^"3 


cr t crcr t crcr t crcr 

3 L J3 ^33 L 33 


aa acr crcrc t fcer 


X J ^ w 


t tgtcccgat 


ccatccaaaa 


a a cr a cr fc a cr cr t* 




aaaaacrttaa 

w 3 w *v*>"3 3 


acrcracraa a c a 


tacct t fccrcra 

w w w w wyy gl 


1440 


aaat act haa 

tAtA wv w w| w W WtM 


aoatttttfch 

yywiwwwwwww 


tttcttatt' t- 

WW UwUUClwwL. 


icnn 


ct etcrt" t~ t~ err 1 

W W Vp» wy L w wy w 


oiv^w i>Lyu uyu 


LyaLa L Ly ua 




catt*ctortca 

w t» www- i— y i>ua 


uuL-uyyu uyy 




i con 


t c c acrcrcr t h c 

w w w&HMH w w w> 


t crcrcaa 1 1 cr* 


erect* radhph 


X D O KJ 


accacaccca 

w> wuVvl w> w? >3 


attaatttth 

y w I— CIC1 w w w w w 


crt* at-t-t-t-t-acf 


1 74ft 


a t* crat" rh t~era 
a. uy y l- w u w ya 




t* cs» etr*rfr* nahp 
u Lay y uyaLL 


1DUU 


t fcaeaacrt~crt" 

w 1—aL-aay ^~3 ^* 


yayuuaL^y^ 


rfnppTirrr'p't' rf 
yLLCyyccuy 


loOU 


aaaL^LL Lay 


crt- hah of- hhrr 


e tg tygc ty t 




as a t" acre 1~ a cr 

VlClGI w Ry w< w CLVH 


y L.a L La L L L L 


h t* era aa{*ai*s 
L. LyaadLaUa 


130U 


ca era crooa a cr 


ca aarnfffh 
LaaaLOLLLL 


fyrra aa ph c*tri e* 




t* cr ct~ crcra a ere 

3^^33 aa 3^ 


a f~cra a oa rrr^t* 


LL-aLL-dyy La 


z ± u u 


tfccaacctaa 

w w w<wiy w> W> UGICI 


crcrcrcr err* crcra a 


et cr era crt - f~ nt* f- 
yyya.yuL.yww 




aerfcertert* crcra 
»y ^»y ^-y ^yy a 


era ca a a rrnna 
yauaaay y i_>a 


y w LLLLdLLd 




e c t" crccra t~ err* 
w w y ^yo i»y o 


LULLLy l> auy 


L.OL LUdHJU L 




aaoo t cfcrcc t 

s*^* 3 3 w 3 3 w 


ttttchctcc 

w w w w w W WJ w L- Vw- 


a erect* t"era a t" 




tactaataca 


crtccacact* t* 

jg w w> wGl w L. w 


rraaaorraa 




tctctcctgt 


gaccttggga 


ttctgtcttg 


2460 


gtccttcctg 


tttggccaca 


gctctttcaa 


2520 


agagctggaa 


gggacgegat 


aceggtttae 


2580 


ggacggaggg 


ttttggtgac 


agagegagag 


2640 


tgcctcccat 


cagccatctt 


tctcccaata 


2700 



2716 
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<210> 51 

<211> 3539 

<212> DNA 

<213> Homo sapien 

<400> 51 

caatgagaag acttactttc ctggtaggca gcctgctttg ttttgcacag gaagatagat 60 

tttttttctc atctttctta taaacaacct catgcacatt ctgtgtttga gccaagacta 120 

gtcgaattcg aattcgaatt cgaagcggct ggcggacaga acggattgca gggtcagcca 180 

tgtcatctga gcctccccca ccaccacagc cccccaccca tcaagcttca gtcgggctgc 24 0 

tggacacccc tcggagccgt gagcgctcac catcccctct gcgcggcaac gtggtcccaa 300 

gcccactgcc cactcgccgg acgaggacct tctcggcgtg agtctggagc aggctgggga 360 

cgcgttctca gatggatctc ttgcctgagg ccagagattc actgtgtctc tggccatttt 42 0 

tgggcctatg tcccggactg ccaaataggc agacctgggg ctgggccgtg ggctgagggt 480 

gagaaactag gaaggcgcct tgtaaattca gggtactgcc cagcccagaa gaacctgtta 540 

ccagctggga gtggcaatgg gtgggcagtt gctggggtca gaatcgctgg gctttaacca 600 

cacatgagag tctggttccc tgtgagactg tgaattctgc agctgcaaaa tgggcacagc 660 

accagggctg caaaggaccc acataatgcc tggcagatgc aggtctcagg ccttgcgagt 720 

ggaggatgct tgtgttatta atagtaaatc attagtttct ggagggatgt tcagtgatgt 780 

tttagttcaa ctccctcttt ccaggaggga aaacaggccc gagaaaattg gccagggtca 840 

cccaaagagt cctttccccc agcgtctttg tgagcggctc ccagggttga gggttgtgca 900 

gtcctcagcc tgtgtgactg tacatggcgg tcctggccag ggcttgatgg ggcacagctg 960 

tgggcagggg ctggaatgag ctttgtgtgc ccaggagcgc tgatggggcc cctgacattt 1020 

ccccctctct gtcaggacgg tgcgggcttc acagggcccc gtctacaaag gagtctgcaa 1080 

atgcttctgc cggtccaagg gccatggctt cattacccca gctgatggcg gccccgacat 1140 

cttcctgcac atctctgatg tggaagggga gtatgtccca gtggaaggcg acgaggtcac 12 00 

ctataaaatg tgctccatcc cacccaagaa tgagaagctg caggccgtgg aggtcgtcat 1260 

cactcacctg gcaccaggca ccaagcatga gacctggtct ggacatgtca tcagctccta 1320 

ggagatggtg gaagcacccc ttgtcctgtg cttgtgggag actttgcggg gaggaggcag 13 80 

cagacactgg agatgacatt cttccacacg agacggggct tcagccgggc atggtccctc 1440 

tcaagtatct cctggaggaa ggggtatggg gggcaggtgt ggggtgtggg gtgttcccgg 15 00 

ccatcagcac agcctatgac cattgcaaca acctctcacc atctgaagag cattaaaagc 1560 

atttaaaaag gagaggtgcc cactggtggc tgagtggagg ttccaacccc atcccaggga 1620 
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gtggatcaag 


ggtggtattt 


ctccagctgc 


tcagacacat 


gggctcaacc cacagaatcc 


1680 


ctcttcctcc 


tggagctgga 


ggccccagat 


tcccagatct 


ggccccctgg cagcctgaca 


1740 


gggaccttgc 


gtgacttctc 


caaggcaaat 


ttccacctaa 


gtgccccttg cgcctctcct 


1800 


ggggcctggg 


caaagcagtt 


ttctaattct 


tggcttggtt 


ggttctaggg gagctggctt 


1860 


gaagtgggtg gggaaaggcg ggggtggcgg tctttggatt 


ggacggatgt tgccttttgg 


1920 


tgcctttgca 


gtgggaggcg gcatagctgc 


ctgtctgggg 


aagacagttc tcccagcact 


1980 


cccacccctg 


ggcacagcag 


gctggtactg ggaggctgaa 


cccctcttag agcctgacct 


2040 


tttcatctgc 


cttctggttg 


tgtgaccatc 


actcaacagc 


catttcacag cccctgtaat 


2100 


tatggcggcg 


gggggctggg 


gtggtggtgg 


tgggaagggc 


ttgtggagag gacacagtct 


2160 


ttgtttaaaa 


actttgtccc 


gatccatcca gaaaagagta 


ggtagcttgc atcctgacag 


2220 


cctggcaaag 


tcaagaaagt 


tgaaggagaa 


acataccttt 


ggagaggggg ttttctttaa 


2280 


aactagtgtt 


aagaaatgct 


tagggatttt 


ttttttctta 


tttttcataa ctaaagcttt 


2340 


cacccagagc 


cggctctgtt 


tgcactttgc 


tgccgacatt 


gcaaactttt tggcagggtg 


2400 


ggagactgag 


tctcattctg 


tcacccaggc 


tggagtgcag 


tggcccgatc tcagcttact 


2460 


gcaacctctg 


ccctccaggg 


ttctggcaat 


tccgcctcag 


tctcctgagt agctgggatt 


2520 


acaggcatgc 


gccaccacac 


ccggttaatt 


tttgtatttt 


tagtagagac caggtttcat 


2580 


catgttgggc 


aggatggtct 


tgaacccctg 


acctcaggtg 


atctgcccat ctcggcctct 


2640 


caaagtgctg 


ggattacaag 


tgtgagccat 


cgcgcccggc 


ctgcaaactt ttttgtaggt 


2700 


atttctggta 


aacaaatcct 


taggttatct 


ttgctgtggt 


tgtggtttgg ctttagtcat 


2760 


gatttcaaag 


tagaaatagc 


taggcattat 


tttttgaaat 


atatgaccta tatgtagtca 


2820 


agaatccact 


gaacagaggg 


aagcaaacct 


tttggaaact 


ggcttttggg cagacagtaa 


2880 


acgtccagtt 


tgatgctgga 


agcatgaaca gcttcatcag gtaggtactc ctcaactctg 


2940 


atgagtttgt 


cctttcagcc 


taagggggtg gaagggagtt 


gtttgagaat agcaaatacg 


3000 


catgttgatt 


gcgagtgtgt 


ggagacaaag gcagttccca 


ccacagttag gtcctggcca 


3060 


gtttgtttcc 


tcgcctgcga 


tgctccttgt 


acgtcctcac 


cctcctctcc cgcctctgcc 


3120 


ttctgctggg 


tcaaaggtgg 


cctttttctc 


tccagccttg 


aattgttccc tgttggcttc 


3180 


ccaagggccc 


atctgctggt 


acagtccaca 


cttccaaagc 


caagacccga gagggctttc 


3240 


actgccccaa gcctctctcc tgtgaccttg ggattctgtc ttggcagaat cctttgtcag 


3300 


cggctcttgc 


tctgtccttc 


ctgtttggcc 


acagctcttt 


caatcaatgg gtattctaga 


3360 


accgcaggat 


gtcagagctg 


gaagggacgc 


gataccggtt 


tacacaaggg gaaactcctc 


3420 
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gaggctctgg gagggacgga 


gggttttggt 


gacagagcga 


gagctaaaat 


tgaggattcc 


3480 


tgaatccaga 


tcttgcctcc 


catcagccat 


ctttctccca 


ataaattttt 


gttttgtgc 


3539 


<210> 52 

<211> 2810 

<212> DNA 

<213> Homo sapien 












<400> 52 
caaggaattt 


tagggcggac 


gtcgggtagt 


ctttcccgcc 


tgcccacctg 


ttctctgcgt 


60 


gcaggggaag 


ctcttcccgc 


cctccccgct 


gcttttgcct 


ttgggcggga 


gccctggacg 


120 


ggggactcgc 


cgggcggtgg 


ggcgaggcgc 


ggggggcgtc 


gcacgcggga 


agcagagcca 


180 


ccccccgccg 


ggcctggccg 


cgccagctgt 


ggctcgaggt 


tgggacgtca 


gccatgtcat 


240 


ctgagcctcc 


cccaccacca 


cagcccccca 


cccatcaagc 


ttcagtcggg 


ctgctggaca 


300 


cccctcggag 


ccgtgagcgc 


tcaccatccc 


cgtctacaaa 


ggagtctgca 


aatgcttctg 


360 


ccggtccaag 


ggccatggct 


tcattacccc 


agctgatggc 


ggccccgaca 


tcttcctgca 


420 


catctctgat 


gtggaagggg 


agtatgtccc 


agtggaaggc 


gacgaggtca 


cctataaaat 


480 


gtgctccatc 


ccacccaaga 


atgagaagct 


gcaggccgtg 


gaggtcgtca 


tcactcacct 


540 


ggcaccaggc 


accaagcatg 


agacctggtc 


tggacatgtc 


atcagctcct 


aggagatggt 


600 


ggaagcaccc 


cttgtcctgt 


gcttgtggga 


gactttgcgg 


ggaggaggca 


gcagacactg 


660 


gagatgacat 


tcttccacac 


gagacggggc 


ttcagccggg 


catggtccct 


ctcaagtatc 


720 


tcctggagga 


aggggtatgg 


ggggcaggtg 


tggggtgtgg 


ggtgttcccg 


gccatcagca 


780 


cagcctatga 


ccattgcaac 


aacctctcac 


catctgaaga 


gcattaaaag 


catttaaaaa 


840 


ggagaggtgc 


ccactggtgg 


ctgagtggag 


gttccaaccc 


catcccaggg 


agtggatcaa 


900 


frrrfT H «**ii"tf* sa +- 

yyy t-ggua u u 


ucuccagcxg 


ctcagacaca 


tgggctcaac 


ccacagaatc 


cctcttcctc 


960 


ctggagctgg 


aggccccaga 


ttcccagatc 


tggccccctg 


gcagcctgac 


agggaccttg 


1020 


cgtgacttct 


ccaaggcaaa 


tttccaccta 


agtgcccctt 


gcgcctctcc 


tggggcctgg 


1080 


gcaaagcagt 


tttctaattc 


ttggcttggt 


tggttctagg 


ggagctggct 


tgaagtgggt 


1140 


ggggaaaggc 


gggggtggcg 


gtctttggat 


tggacggatg 


ttgccttttg 


gtgcctttgc 


1200 


agtgggaggc 


ggcatagctg 


cctgtctggg 


gaagacagtt 


ctcccagcac 


tcccacccct 


1260 


gggcacagca 


ggctggtact 


gggaggctga 


acccctctta 


gagcctgacc 


ttttcatctg 


1320 


ccttctggtt 


gtgtgaccat 


cactcaacag 


ccatttcaca 


gcccctgtaa 


ttatggcggc 


1380 


ggggggctgg 


ggtggtggtg 


gtgggaaggg 


cttgtggaga 


ggacacagtc 


tttgtttaaa 


1440 


aactttgtcc 


cgatccatcc 


agaaaagagt 


aggtagcttg 


catcctgaca 


gcctggcaaa 


1500 
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gtcaagaaag ttgaaggaga aacatacctt tggagagggg gttttcttta aaactagtgt 1560 

taagaaatgc ttagggattt tttttttctt atttttcata actaaagctt tcacccagag 1620 

ccggctctgt ttgcactttg ctgccgacat tgcaaacttt ttggcagggt gggagactga 1680 

gtctcattct gtcacccagg ctggagtgca gtggcccgat ctcagcttac tgcaacctct 1740 

gccctccagg gttctggcaa ttccgcctca gtctcctgag tagctgggat tacaggcatg 1800 

cgccaccaca cccggttaat ttttgtattt ttagtagaga ccaggtttca tcatgttggg 1860 

caggatggtc ttgaacccct gacctcaggt gatctgccca tctcggcctc tcaaagtgct 1920 

gggattacaa gtgtgagcca tcgcgcccgg cctgcaaact tttttgtagg tatttctggt 1980 

aaacaaatcc ttaggttatc tttgctgtgg ttgtggtttg gctttagtca tgatttcaaa 2040 

gtagaaatag ctaggcatta ttttttgaaa tatatgacct atatgtagtc aagaatccac 2100 

tgaacagagg gaagcaaacc ttttggaaac tggcttttgg gcagacagta aacgtccagt 2160 

ttgatgctgg aagcatgaac agcttcatca ggtaggtact cctcaactct gatgagtttg 2220 

tcctttcagc ctaagggggt ggaagggagt tgtttgagaa tagcaaatac gcatgttgat 2280 

tgcgagtgtg tggagacaaa ggcagttccc accacagtta ggtcctggcc agtttgtttc 2340 

ctcgcctgcg atgctccttg tacgtcctca ccctcctctc ccgcctctgc cttctgctgg 2400 

gtcaaaggtg gcctttttct ctccagcctt gaattgttcc ctgttggctt cccaagggcc 2460 

catctgctgg tacagtccac acttccaaag ccaagacccg agagggcttt cactgcccca 2520 

agcctctctc ctgtgacctt gggattctgt cttggcagaa tcctttgtca gcggctcttg 2580 

ctctgtcctt cctgtttggc cacagctctt tcaatcaatg ggtattctag aaccgcagga 2640 

tgtcagagct ggaagggacg cgataccggt ttacacaagg ggaaactcct cgaggctctg 2700 

ggagggacgg agggttttgg tgacagagcg agagctaaaa ttgaggattc ctgaatccag 2760 

atcttgcctc ccatcagcca tctttctccc aataaatttt tgttttgtgc 2810 

<210> 53 

<211> 4725 

<212> DNA 

<213> Homo sapien 

<400> 53 

aatttgaaaa ccccatggga cctgagtcaa accgatccag ggctgctgct gtccctcccg 60 

gctgtgtggt gttgagtggc gtcctgagtt aggcttctcc ctacttcatc tcataccccc 120 

atcaccagcc ctgcgcaggt gccaagtgat gcccacccag ctgccccact ttgcagattc 180 

ggaaactgag cccagagatg ggaaaggact tacccaaggg cctctgactg ctggactgtt 240 

tctctttccc ctgaacctcg agctgaggcc acgttagagc cgacaccttc atattcaatg 300 
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ttcagtgatt 


attgtgtaca 


gcgagtgtcc 


tgctcttgaa 


aaattgtcaa 


aagcacagca 


360 


gcagctaggg 


aattcccgcc 


ctccagctgg 


atttaceggt 


acatactagg 


ggtcacagaa 


420 


ggaatttgat 


ttcataacag 


cgtggtgacc 


cgggctgtgt 


ggaaattcac 


tgggectgat 


480 


ggagctggat 


gcagccttgg 


aaattctccc 


tttccctttg 


ggctcccgca 


gcaggccccc 


540 


cagtcctggg 


tgcaggcttg 


tgacagctgc 


tcagggatcc 


aaacgtggcg 


cttgagctct 


600 


ctgcgatgtg 


ttccacagtt 


ccctggaggc 


ggacactgcg 


tgagggtcag 


tttttgcccc 


660 


tgcttccctg 


tggtgtctgg 


ctcccagcag 


cgtcagggag 


ggtccgtgga 


gtggctgagt 


720 


ttggatctag 


gtggttggca 


ttaaaatctc 


cctggctttg 


ggttttcttt 


tttgagacag 


780 


agtcttgctc 


tgtagcccag 


gctggagtgc 


agtggtgcga 


tctcagctca 


ctggaacctc 


840 


cgcctcccag 


gttcaagcaa 


ttctcatgcc 


tcagccttca 


agtagatggg 


attacaggtg 


900 


cctgccacca 


tgcccagcta 


atttttgtat 


tegtattaga 


gactgggttt 


ccccatgttg 


960 


gccaggctag 


tctcgaactc 


ctgacattaa 


gtgatccgcc 


tgcctcggcc 


tcccagagtg 


1020 


ctgggattgc 


aggtgtgagc 


cactgcgccc 


ggcccctggc 


tttgggtttt 


ctgctcacct 


1080 


tcctccttcc 


ctcccacaaa 


tactatgtgt 


cccaaatgtg 


ccgggatcca 


tgccttgtgc 


1140 


tggggacaca 


gaggggaccg 


atgccatcct 


ggtcatgggg 


gaggatgtgg 


cattttcatg 


1200 


aagagctatg 


atcgaggggg 


ccccacggtt 


gcttcaggag 


cccagagtag 


gcatctgacc 


1260 


ctgccaggcg 


tgctcccaga 


caaggtgaca 


atggtgctga 


ggtgcccgag 


aggaatgagc 


1320 


ctcagatact 


ggactgtcag 


acgtggacat 


gggggtaagc 


acagggtccc 


taccttgeaa 


1380 


ggcacgagcc 


cctgacagcc 


tcttctcttt 


ccaggtcagc 


catgtcatct 


gagcctcccc 


1440 


caccaccaca 


gccccccacc 


catcaagctt 


cagteggget 


gctggacacc 


cctcggagcc 


1500 


gtgagcgctc 


accatcccct 


ctgcgcggca 


acgtggtccc 


aagcccactg 


cccactcgcc 


1560 


ggacgaggac 


cttctcggcg 


tgagtctgga 


gcaggctggg 


gaegegttet 


cagatggatc 


1620 


tcttgcctga 


ggccagagat 


tcactgtgtc 


tetggecatt 


tttgggccta 


tgtcccggac 


1680 


tgccaaatag 


gcagacctgg 


ggctgggccg 


tgggctgagg 


gtgagaaact 


aggaaggege 


1740 


cttgtaaatt 


cagggtactg 


cccagcccag 


aagaacctgt 


taccagctgg 


gagtggcaat 


1800 


gggtgggcag 


ttgctggggt 


cagaatcgct 


gggctttaac 


cacacatgag 


agtctggttc 


1860 


cctgtgagac 


tcrtcraattct 






ytaucaygyc 


LyCaaagyaC 


±y*& U 


ccacataatg 


cctggcagat 


gcaggtctca 


ggecttgega 


gtggaggatg 


cttgtgttat 


1980 


taatagtaaa 


tcattagttt 


ctggagggat 


gttcagtgat 


gttttagttc 


aactccctct 


2040 


ttccaggagg 


gaaaacaggc 


ccgagaaaat 


tggccagggt 


cacccaaaga 


gtcctttccc 


2100 


ccagcgtctt 


tgtgagcggc 


tcccagggtt 


gagggttgtg 


cagtcctcag 


cctgtgtgac 


2160 
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tgtacatggc ggtcctggcc agggcttgat ggggcacagc tgtgggcagg ggctggaatg 2220 

agctttgtgt gcccaggagc gctgatgggg cccctgacat ttccccctct ctgtcaggac 2280 

ggtgcgggct tcacagggcc ccgtctacaa aggagtctgc aaatgcttct gccggtccaa 234 0 

gggccatggc ttcattaccc cagctgatgg cggccccgac atcttcctgc acatctctga 2400 

gtgagtcagc ggcaggtctg ggtggcctgg ctggggtggg cgcttccttg gccctggcac 2460 

caggcaccaa gcatgagacc tggtctggac atgtcatcag ctcctaggag atggtggaag 2520 

caccccttgt cctgtgcttg tgggagactt tgcggggagg aggcagcaga cactggagat 2580 

gacattcttc cacacgagac ggggcttcag ccgggcatgg tccctctcaa gtatctcctg 2640 

gaggaagggg tatggggggc aggtgtgggg tgtggggtgt tcccggccat cagcacagcc 2700 

tatgaccatt gcaacaacct ctcaccatct gaagagcatt aaaagcattt aaaaaggaga 2760 

ggtgcccact ggtggctgag tggaggttcc aaccccatcc cagggagtgg atcaagggtg 2820 

gtatttctcc agctgctcag acacatgggc tcaacccaca gaatccctct tcctcctgga 2880 

gctggaggcc ccagattccc agatctggcc ccctggcagc ctgacaggga ccttgcgtga 294 0 

cttctccaag gcaaatttcc acctaagtgc cccttgcgcc tctcctgggg cctgggcaaa 3 000 

gcagttttct aattcttggc ttggttggtt ctaggggagc tggcttgaag tgggtgggga 3060 

aaggcggggg tggcggtctt tggattggac ggatgttgcc ttttggtgcc tttgcagtgg 3120 

gaggcggcat agctgcctgt ctggggaaga cagttctccc agcactccca cccctgggca 3180 

cagcaggctg gtactgggag gctgaacccc tcttagagcc tgaccttttc atctgccttc 3240 

tggttgtgtg accatcactc aacagccatt tcacagcccc tgtaattatg gcggcggggg 3300 

gctggggtgg tggtggtggg aagggcttgt ggagaggaca cagtctttgt ttaaaaactt 33 60 

tgtcccgatc catccagaaa agagtaggta gcttgcatcc tgacagcctg gcaaagtcaa 3420 

gaaagttgaa ggagaaacat acctttggag agggggtttt ctttaaaact agtgttaaga 34 8 0 
aatgcttagg gatttttttt ttcttatttt tcataactaa agctttcacc cagagccggc 3540 
tctgtttgca ctttgctgcc gacattgcaa actttttggc agggtgggag actgagtcfcc 3600 
attctgtcac ccaggctgga gtgcagtggc ccgatctcag cttactgcaa cctctgccct 3660 
ccagggttct ggcaattccg cctcagtctc ctgagtagct gggattacag gcatgcgcca 3720 
ccacacccgg ttaatttttg tatttttagt agagaccagg tttcatcatg ttgggcagga 3780 
tggtcttgaa cccctgacct caggtgatct gcccatctcg gcctctcaaa gtgctgggat 384a 
tacaagtgtg agccatcgcg cccggcctgc aaactttttt gtaggtattt ctggtaaaca 3900 
aatccttagg ttatctttgc tgtggttgtg gtttggcttt agtcatgatt tcaaagtaga 3960 
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aatagctagg cattattttt tgaaatatat gacctatatg tagtcaagaa tccactgaac 4020 

agagggaagc aaaccttttg gaaactggct tttgggcaga cagtaaacgt ccagtttgat 4080 

gctggaagca tgaacagctt catcaggtag gtactcctca actctgatga gtttgtcctt 4140 

tcagcctaag ggggtggaag ggagttgttt gagaatagca aatacgcatg ttgattgcga 4200 

gtgtgtggag acaaaggcag ttcccaccac agttaggtcc tggccagttt gtttcctcgc 4260 

ctgcgatgct ccttgtacgt cctcaccctc ctctcccgcc tctgccttct gctgggtcaa 4320 

aggtggcctt tttctctcca gccttgaatt gttccctgtt ggcttcccaa gggcccatct 43 80 

gctggtacag tccacacttc caaagccaag acccgagagg gctttcactg ccccaagcct 4440 

ctctcctgtg accttgggat tctgtcttgg cagaatcctt tgtcagcggc tcttgctctg 4500 

tccttcctgt ttggccacag ctctttcaat caatgggtat tctagaaccg caggatgtca 4560 

gagctggaag ggacgcgata ccggtttaca caaggggaaa ctcctcgagg ctctgggagg 4620 

gacggagggt tttggtgaca gagcgagagc taaaattgag gattcctgaa tccagatctt 4680 

gcctcccatc agccatcttt ctcccaataa atttttgttt tgtgc 4725 

<210> 54 

<211> 1147 

<212> DNA 

<213> Homo sapien 

<400> 54 

gaggatcccc atcgaggcsa tgtgaagaca gccagccgac attcttccaa kkggaagatg 60 

tgcatagaaa cagatacaaa caaatcacat atatggctca aggtcttgct gtgagagaga 120 

tgactggtat gactaagttt aaaccctatt tcatttctgc aacttctgag attttgtctc 180 

aaaaatgcat taatacaaat gtgctttttt tgtctctctc agataatcat ggacatattg 240 

atccctctct gagactcatc tgggatttgg ctttccttgg aagctcttac gtgatgtggg 3 00 

aaatgacaac acaggtgtca cactactact tggcacagct cactagtgtg aggcagtgga 360 

aaactaatga tgatacaatt gattttgatt atactgttct acttcatgaa ttatcaacac 42 0 

aggaaataat tccctgtcgc attcacttgg tctggtaccc tggcaaacct cttaaagtga 480 

agtaccactg tcaagagcta cagacaccag aagaagcctc cggaactgaa gaaggatcag 540 

ctgtagtacc aacagagctt agtaatttct aaaaagaaaa aatgatcttt ttccgacttc 600 

taaacaagtg actatactag cataaatcat tcttctagta aaacagctaa ggtatagaca 660 

ttctaataat ttgggaaaac ctatgattac aagtaaaaac tcagaaatgc aaagatgttg 720 

gttttttgtt tctcagtctg ctttagcttt taactctgga agcgcatgca cactgaactc 780 

tgctcagtgc taaacagtca ccagcaggtt cctcagggtt tcagccctaa aatgtaaaac 840 
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ctggataatc agtgtatgtt 


gcaccagaat 


cagcattttt 


tttttaactg 


caaaaaatga 


900 


tggtctcatc 


tctgaattta 


tatttctcat 


tcttttgaac 


atactatagc 


taatatattt 


960 


tatgttgcta aattgcttct 


atctagcatg 


ttaaacaaag 


ataatatach 






gtaaattata ggaaaaaaat 


taactgtttt 


aaaaagaact 


tgattatgtt 


ttatgatttc 


1080 


aggcaagtat 


tcatttttaa 


cttgctacct 


acttttaaat 


aaatgtttac 


atttctaaat 


1140 


aatgaaa 












1147 


<210> 55 

<211> 1415 

<212> DNA 

<213> Homo sapien 












<400> 55 
caactgcctg 


cctactgaaa 


aatatccaat 


gcactggttt 


tagaaacagc 


gtgtatagct 


60 


tcatgtggta 


tatatgaaga 


atgtgggtgt 


tggagtcagt 


agatctgggt 


tggaattccc 


120 


atttgtccct 


tcctagctgt 


aatctcaaat 


tacctctcct 


gtaaaaccag 


gctcataata 


180 


cacagattgt 


aaaggagaat 


tagatgagaa 


catgtagaag 


gtgcctagca 


cagaacccag 


240 


cccatggtgg 


accttctaga 


attactgatt 


gcccctcccc 


ttgtcttatt 


gcctgtcata 


300 


tcagggctaa 


tatatcagct 


tgaggtatat 


atacagtatc 


tagatcctta 


aggttgttaa 


360 


aagaatctcc 


ctattcccac 


agacttcggt 


ctccatcaga 


aaaagtgaca 


gacattcaac 


420 


cagactttca 


tataactcag 


tgacaagaca 


aaaatgagga 


tccatgactt 


ccatatcttg 


480 


aagcaaaata 


ccaccaaaaa 


ccgtgaggct 


gaaatcgaaa 


aggcagtagg 


agacacaagg 


540 


cacccattca 


aaacaagaat 


gtattgtata 


gttattacac 


taaataccac 


catatttatt 


600 


acactaactc 


ttttttctcc 


ctctagaaaa 


actaatgatg 


atacaattga 


ttttgattat 


660 


actgttctac 


ttcatgaatt 


atcaacacag 


gaaataattc 


cctgtcgcat 


tcacttggtc 


72 0 


tggtaccctg 


gcaaacctct 


taaagtgaag 


taccactgtc 


aagagctaca 


gacaccagaa 


780 


gaagcctccg 


gaactgaaga 


aggatcagct 


gtagtaccaa 


cagagcttag 


taatttctaa 


840 


aaagaaaaaa 


tgatcttttt 


ccgacttcta 


aacaagtgac 


tatactagca 


taaatcattc 


900 


ttctagtaaa 


acagctaagg 


tatagacatt 


ctaataattt 


gggaaaacct 


atgattacaa 


960 


gtaaaaactc 


agaaatgcaa 


agatgttggt 


tttttgtttc 


tcagtctgct 


ttagctttta 


1020 


actctggaag 


cgcatgcaca 


ctgaactctg 


ctcagtgcta 


aacagtcacc 


agcaggttcc 


1080 


tcagggtttc 


agccctaaaa 


tgtaaaacct 


ggataatcag 


tgtatgttgc 


accagaatca 


1140 


gcattttttt 


tttaactgca 


aaaaatgatg 


gtctcatctc 


tgaatttata 


tttctcattc 


1200 


ttttgaacat 


actatagcta 


atatatttta 


tgttgctaaa 


ttgcttctat 


ctagcatgtt 


1260 
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aaacaaagat aatatacttt cgatgaaagt aaattatagg aaaaaaatta actgttttaa 1320 

aaagaacttg attatgtttt atgatttcag gcaagtattc atttttaact tgctacctac 1380 

ttttaaataa atgtttacat ttctaaataa tgaaa 1415 

<210> 56 

<211> 2201 

<212> DNA 

<213> Homo sapien 

<400> 56 

caggatcaga tacgcacttc tgcttcttct acaggagaga agagaatagt tttgatagag 60 

accagggaag aagccacctg tgccaaagca tggatgggga ccagacagaa gagagaatga 12 0 

tgaagatgat ggtgcatcag agacctttgc cccagccagc actacttcct atgagctcca 180 

acactttccc ttcaagatct actaagccaa gtcccatgaa ccctctccca tcctctcaca 24 0 

tgcctggagc attctcagaa agtaacagca gttttccaca gagtgcctcc ctgccaccat 300 

acttctctca aggccctagc aacagaccac ctatcagagc cgaaggcaga aacttcccct 36 0 

tgccacttcc aaacaaacct cggcccccat cccccgcgga ggaagagaat tcattaaatg 420 

aagagtggta cgtttcttat attacccgac cagaggcaga agctgctctt agaaagataa 480 

accaggatgg cacatttctg gtcagagaca gctctaaaaa aacaacaacc aatccatatg 54 0 

tcctcatggt gttgtacaaa gataaagttt acaacatcca gatccgttat cagaaggaaa 600 

gtcaagttta cttgttggga actggactcc gagggaaaga ggactttctg tctgtgtcag 660 

atattattga ctacttcagg aaaatgccac ttctgctcat tgatgggaaa aaccgaggtt 72 0 

ccagatacca gtgcacatta acgcatgctg cagggtaccc atagcaagtt atagccgagc 780 

aaatgaaccg tcctcctgcc tctgttgcca acacgagatc aatcagcctt ggtcaatgga 840 

caaacactta ggactgaact gaacccctcc ccatgaacac aagggtttta tcctttcctt 900 

taaaaacagt gtttgaaatg aagactgtca actatcccat aatttattta ttcttcttca 960 

atgtttgtaa agtgcatgag tcatgttcac acttgaagtc tagtagtgca ctgtaataat 102 0 

tcatttttta aaagattatt taatgcccat ttcaaaatac agtagtttac acagctacag 1080 

aaacaatttg gggcaagttt taaaacactg aaacagtaat agttattggt gtcacataaa 114 0 

actgatttgt tttttacagc caaacctctg tcagtcagag gcattcatta gttttataca 1200 

tgtaatttga aaatcactaa acctcgtttt ctcagcagca ataatttaag aggcttcaaa 1260 

aatataattt cactcttatt tagtattttt tcctgggggc atttttacgt aattttttta 1320 

tgaaaagaca aatgcatgtt gagataactt ctgggattaa aatagtcttt tgctttactt 1380 

ttttggtttc ctaaaacaac tttattgact tttagtccat actgttatat ttttgtctta 1440 
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aagaaaattt 

gtgtatcgtg 

ctccacaccc 

tgtccaggga 

attaaatgtg 

acagctctgc 

gaggtaatta 

cagccagatt 

ttccctttgg 

ctgcttaatc 

ctaataaaag 

taaaataagg 

aaaatctcat 



aaactacaaa 

ggtctaggtt 

acaccaacca 

tatagatttt 

atcagacttt 

cataaagtcc 

aactctttcc 

aaaaaactgg 

gaccttggtt 

tacaagtttt 

gggtattgga 

agtatgcaga 

ccttaacaac 



taccaaaaga 

taatgaacac 

tcacagcccc 

accaaaggct 

cctattagat 

cttctcctca 

tccagtttac 

tattcagcca 

gttattgcac 

cgaaatggaa 

aggaatctaa 

aggtagagac 

cagaagttct 
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aaacatttta 
atctggggtt 
ccaccaacct 
tcctgggaag 
atggctcttc 
tccttcccaa 
aaatatcact 
aatagtgaca 
aacttttatt 
aagagtatct 
gaagtcacca 
ttttaactga 
cagtttaggg 



aatttaggga 

aagttggcat 

tctcccaacc 

acgagggagc 

tgtctcttgt 

acaggctgta 

taacaaaaaa 

atcagttgtt 

agcaacaatt 

tgcagcttca 

ttttaaaact 

tgataaaaat 



tgagactttg 

ttcttcacat 

ccaaaagcat 

aacactttag 

tatccccctg 

taagtgcttt 

tataggcatt 

cttcaagttt 

tttggcgtct 

ttttcatgag 

gatgatatgt 

ggtgtttcac 



<210> 57 

<211> 2435 

<212> DNA 

<213> Homo sapien 

<400> 57 

ccccttccag tccctgctcc ttcaatagac agaagcacga aacctcccct 
ttagctccgt ttgatagaga acccttcaca ctaggggagg ttgcgtgctt 
aaggagcacg aaacctcccc tagatcgttc attagctccg tttgatagag 
actaggaaag aaaccaccat tttctgacaa gccctcgatt ccagcgggaa 
ggagcattta cccaagattc aaaagcctcc tttaccaccg accacggaaa 
gagcagcccc ctgccaggga agaagccacc tgtgccaaag catggatggg 
aagagagaat gatgaagatg atgtgcatca gagacctttg ccccagccag 
tatgagctcc aacactttcc cttcaagatc tactaagcca agtcccatga 
atcctctcac atgcctggag cattctcaga aagtaacagc agttttccac 
cctgccacca tacttctctc aaggccctag caacagacca cctatcagag 
aaacttcccc ttgccacttc caaacaaacc tcggccccca tcccccgcgg 
ttcattaaat gaagagtggt acgtttctta tattacccga ccagaggcag 
tagaaagata aaccaggatg gaaacatttc tcacatttat gtcttctaac 
aaagaatcag taactgccaa tgaatgagat agcataaaat tttgactttc 



1500 
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1740 
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agatcgttca 

ctgtctattg 

aacccttcac 

ggtcactcgg 

gacatgaaag 

gaccagacag 

cactacttcc 

accctctccc 

agagtgcctc 

ccgaaggcag 

aggaagagaa 

aagctgctct 

acacgtgggt 

ataaaacaaa 
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360 
420 
480 
540 
600 
660 
720 
780 
840 
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acttgtaaat catttctgaa atttataagg tttccagctg acaattttct gccaagttaa 

catttcaaaa atgaaaaaca aaatcaatgt gtattttaga gt at cat eta gatctgcgcc 960 

cctttgtaaa tggattctat ggtggttaag tcatctagaa acaccctggg agcaaggttc 1020 

catcctacta cttcctaatt acatctgatc tgctttattt ttgtgtcctc tttatactag 1080 

ctgtttctaa ctggggggcg gttatggaat gtttcttagt gacagctggg gcctgccaat 1140 

ctcattcatt gttttctact tetgeaegtg catcagagac ctttgcccca gccagcacta 1200 

cttcctatga gctccaacac tttcccttca agatctacta agccaagtcc catgaaccct 1260 

ctcccatcct ctcacatgcc tggagcattc tcagaaagta acagcagttt tccacagagt 1320 

gcctccctgc caccatactt ctctcaaggc cctagcaaca gaccacctat cagagecgaa 13 8 0 

ggcagaaact tccccttgcc acttccaaac aaacctcggc ccccatcccc cgcggaggaa 144 0 

gagaattcat taaatgaaga gtggtacgtt tcttatatta cccgaccaga ggcagaagct 1500 

gctcttagaa agataaacca ggatggcaca tttctggtca gagacagctc taaaaaaaca 1560 

acaaccaatc catatgtcct catggtgttg tacaaagata aagtttacaa catccagatc 1620 

cgttatcaga aggaaagtca agtttacttg ttgggaactg gactccgagg gaaagaggac 1680 

tttctgtctg tgtcagatat tattgactac ttcaggaaaa tgccacttct gctcattgat 1740 

gggaaaaacc gaggttccag ataccagtgc acattaaege atgetgeagg gtacccatag 1800 

caagttatag ccgagcaaat gaaccgtcct cctgcctctg ttgccaacac gagatcaatc 1860 

agccttggtc aatggacaaa cacttaggac tgaactgaac ccctccccat gaacacaagg 1920 

gttttatcct ttcctttaaa aacagtgttt gaaatgaaga ctgtcaacta tcccataatt 1980 

tatttattct tcttcaatgt ttgtaaagtg catgagtcat gttcacactt gaagtctagt 2 040 

agtgcactgt aataattcat tttttaaaag attatttaat gcccatttca aaatacagta 2100 

gtttacacag ctacagaaac aatttggggc aagttttaaa acactgaaac agtaatagtt 2160 

attggtgtca cataaaactg atttgttttt tacagecaaa cctctgtcag tcagaggcat 2220 

tcattagttt tatacatgta atttgaaaat cactaaacct cgttttctca gcagcaataa 2280 
tttaagaggc ttcaaaaata taatttcact cttatttagt attttttcct gggggcattt 2340 
ttaegtaatt tttttatgaa aagacaaatg catgttgaga taacttctgg gattaaaata 2400 
gtcttttget ttaaaaaacc tcgccgcgac caege 2435 

<210> 58 

<211> 3612 

<212> DNA 

<213> Homo sapien 



<400> 58 
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cgaactccgc cccctcttct ccgcgtaggc ccagctccct gaagcggctg tttcgagcca 
cgcgcccatc gggtaccgag gcacgcgccg ggcgtcacgt gcgtttcgcg gcgagcggaa 
atgacgcgag ttgtgtgagc cgccagtatg gccgggctat ggcggcgagc actggctacg 
tgcgactgtg gggagcggcg cggtgctggg tgctgcggcg gccgatgctg gccgccgccg 

gggggcgggt tcccactgca gcaggagcgt ggttgctccg aggccagcgg acctgcgacg 300 

cctctcctcc ttgggcactg tggggccgag gcccggcaat tgggggccaa tggcgggggt 360 

tttgggaagc gagcagccgc ggcggaggcg cattctcggg gggcgaggac gcctccgagg 420 

gcggcgcgga ggaaggagcc ggcggcgcgg ggggcagcgc gggcgccggg gaaggcccgg 480 

tcataacggc gctcacgccc atgacgatcc ccgatgtgtt tccgcacctg ccgctcatcg 54 0 

ccatcacccg caacccggtg ttcccgcgct ttatcaagat tatcgaggtt aaaaataaga 600 

agttggttga gctgctgaga aggaaagttc gtctcgccca gccttatgtc ggcgtctttc 660 

taaagagaga tgacagcaat gagtcggatg tggtcgagag cctggatgaa atctaccaca 720 

cggggacgtt tgcccagatc catgagatgc aggaccttgg ggacaagctg cgcatgatcg 780 
tcatgggaca cagaagagtc catatcagca gacagctgga ggtggagccc gaggagccgg 840 
aggcggagaa caagcacaag ccccgcagga agtcaaagcg gggcaagaag gaggcggagg 900 
acgagctgag cgccaggcac ccggcggagc tggcgatgga gcccacccct gagctcccgg 960 

ctgaggtgct catggtggag gtagagaacg ttgtccacga ggacttccag gtcacggagg 1020 

aggtgaaagc cctgactgca gagatcgtga agaccatccg ggacatcatt gccttgaacc 1080 

ctctctacag ggagtcagtg ctgcagatga tgcaggctgg ccagcgggtg gtggacaacc 1140 

ccatctacct gagcgacatg ggcgccgcgc tcaccggggc cgagtcccat gagctgcagg 1200 

acgtcctgga agagaccaat attcctaagc ggctgtacaa ggccctctcc ctgctgaaga 1260 

aggaatttga actgagcaag ctgcagcagc gcctggggcg ggaggtggag gagaagatca 132 0 

agcagaccca ccgtaagtac ctgctgcagg agcagctaaa gatcatcaag aaggagctgg 1380 

gcctggagaa ggacgacaag gatgccatcg aggagaagtt ccgggagcgc ctgaaggagc 144 0 

tcgtggtccc caagcacgtc atggatgttg tggacgagga gctgagcaag ctgggcctgc 1500 

tggacaacca ctcctcggag ttcaatgtca cccgcaacta cctagactgg ctcacgtcca 1560 

tcccttgggg caagtacagc aacgagaacc tggacctggc gcgggcacag gcagtgctgg 162 0 

aggaagacca ctacggcatg gaggacgtca agaaacgcat cctggagttc attgccgtta 168 0 

gccagctccg cggctccacc cagggcaaga tcctctgctt ctatggcccc cctggcgtgg 1740 

gtaagaccag cattgctcgc tccatcgccc gcgccctgaa ccgagagtac ttccgcttca 1800 
gcgtcggggg catgactgac gtggctgaga tcaagggcca caggcggacc tacgtgggcg 1860 
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ccatgcccgg gaagatcatc cagtgtttga agaagaccaa gacggagaac cccctgatcc 
tcatcgacga ggtgcgtgcg gagaggcccg gagaccccct aacccggcaa tgccgtgtgc 
aagatgaagc cgagccttcc gggtcctagg atccccgagc caactcaggg gccatcgccg 
tgatgctggc cctgatggca cggacgggtc cctcccaccc acagccacgc ccagcaggct 
gcccgtccaa ggcagggtgt gggtgccgct ctcactcggg gtctccccag ccctgaccct 
ggggtctccc cactgggcgg gctggccagg ggcacggcga ggggcaccac ctgctgtctg 
ctgccccagg tggacaagat cggccgaggc taccaggggg acccgtcgtc ggcactgctg 
gagctgctgg acccagagca gaatgccaac ttcctggacc actacctgga cgtgcccgtg 
gacttgtcca aggtgggggg cctgtctggg gcctgggctt gctggggagg gtaagcggcc 
agtcccccag caccttgccg ccgcccccag gtgctgttca tctgcacggc caacgtcacg 
gacaccatcc ccgagccgct gcgagaccgt atggagatga tcaacgtgtc gggctacgtg 
gcccaggaga agctggccat tgcggagcgc tacctggtgc cccaggctcg cgccctgtgt 
ggcttggatg agagcaaggc caagctgtca tcggacgtgc tgacgctgct catcaagcag 
tactgccgcg agagcggtgt ccgcaacctg cagaagcaag tggagaaggt gttacggaaa 
tcggcctaca agattgtcag cggcgaggcc gagtccgtgg aggtgacgcc cgagaacctg 
caggacttcg tggggaagcc cgtgttcacc gtggagcgca tgtatgacgt gacaccgccc 
ggcgtggtca tggggctggc ctggaccgca atgggaggct ccacgctgtt tgtggagaca 
tccctgagac ggccacagga caaggatgcc aagggtgaca aggatggcag cctggaggtg 
acaggccagc tgggggaggt gatgaaggag agcgcccgca tagcctacac cttcgccaga 
gccttcctca tgcagcacgc ccccgccaat gactacctgg tgacctcaca catccacctg 
catgtgcccg agggcgccac ccccaaggac ggcccaagcg caggctgcac catcgtcacg 
gccctgctgt ccctggccat gggcaggcct gtccggcaga atctggccat gactggcgaa 
gtctccctca cgggcaagat cctgcctgtt ggtggcatca aggagaagac cattgcggcc 
aagcgcgcag gggtgacgtg catcgtcctg ccagccgaga acaagaagga cttctacgac 
ctggcagcct tcatcaccga gggcctggag gtgcacttcg tggaacacta ccgggagatc 
ttcgacatcg ccttcccgga cgagcaggca gaggcgctgg ccgtggaacg gtgacggcca 
ccccgggact gcaggcggcg gatgtcaggc cctgtctggg ccagaactga gcgctgtggg 
gagcgcgccc ggacctggca gtggagccac cgagcgagca gctcggtcca gtgacccaga 
tcccagggac ctcagtcggc ttaatcagag tgtggcatag aagctattta atgattaaag 
tcatttgcag ta 
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<210> 59 

<211> 2744 

<212> DNA 

<213> Homo sapien 

<400> 59 

tcctgggttc acgccattct cctgcctcag cctcccgagt agctgggact acaggcgccc 60 

gccatcacgc ccgcctaatt tttgtatttt ttggtagaga cggggtttca ctgtgttagc 120 

caggagggtc tcgatctcct gacctcatga tccacccgcc tcggcctccc aaagtgcttg 180 

gattacaggc gtgagccacc acgcccggcc aagacttggt ctcttaaaaa atggaggaaa 240 

gaaaacacag tctctttttt tcccctgaga tagcttatcc taaggacaag gctctgaggg 300 

gtaaggagac agggtttggt tttagttttg actctcttgg tcctgtgtcc ctgggactgg 360 

tgagtccacc agggactgca ctcctccatc tcccaggccg tccgtggcca gggccagttg 420 

gcggagggga gggagttgat ggagctggtg ggtggggctc ccagggccat gctcccaccc 48 0 

agcccacccc ctgctggagc actgctgggc cccagcgacc ctgccccata cttggcgccc 540 

tgactgcaga gatcgtgaag accatccggg acatcattgc cttgaaccct ctctacaggg 600 

agtcagtgct gcagatgatg caggctggcc agcgggtggt ggacaacccc atctacctga 66 0 

gcgacatggg cgccgcgctc accggggccg agtcccatga gctgcaggac gtcctggaag 720 

agaccaatat tcctaagcgg ctgtacaagg ccctctccct gctgaagaag gaatttgaac 78 0 

tgagcaagct gcagcagcgc ctggggcggg aggtggagga gaagatcaag cagacccacc 840 

gtaagtacct gctgcaggag cagctaaaga tcatcaagaa ggagctgggc ctggagaagg 900 

acgacaagga tgccatcgag gagaagttcc gggagcgcct gaaggagctc gtggtcccca 960 

agcacgtcat ggatgttgtg gacgaggagc tgagcaagct gggcctgctg gacaaccact 1020 

cctcggagtt caatgtcacc cgcaactacc tagactggct cacgtccatc ccttggggca 1080 

agtacagcaa cgagaacctg gacctggcgc gggcacaggc agtgctggag gaagaccact 1140 

acggcatgga ggacgtcaag aaacgcatcc tggagttcat tgccgttagc cagctccgcg 1200 

gctccaccca gggcaagatc ctctgcttct atggcccccc tggcgtgggt aagaccagca 1260 

ttgctcgctc catcgcccgc gccctgaacc gagagtactt ccgcttcagc gtcgggggca 1320 

tgactgacgt ggctgagatc aagggccaca ggcggaccta cgtgggcgcc atgcccggga 1380 

agatcatcca gtgtttgaag aagaccaaga cggagaaccc cctgatcctc atcgacgagg 1440 

tggacaagat cggccgaggc taccaggggg acccgtcgtc ggcactgctg gagctgctgg. 1500 

acccagagca gaatgccaac ttcctggacc actacctgga cgtgcccgtg gacttgtcca 1560 

aggtgctgtt catctgcacg gccaacgtca cggacaccat ccccgagccg ctgcgagacc 162 0 
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gtatggagat gatcaacgtg tcgggctacg tggcccagga gaagctggcc attgcggagc 1680 

gctacctggt gccccaggct cgcgccctgt gtggcttgga tgagagcaag gccaagctgt 1740 

catcggacgt gctgacgctg ctcatcaagc agtactgccg cgagagcggt gtccgcaacc 1800 

tgcagaagca agtggagaag gtgttacgga aatcggccta caagattgtc agcggcgagg 1860 

ccgagtccgt ggaggtgacg cccgagaacc tgcaggactt cgtggggaag cccgtgttca 1920 

ccgtggagcg catgtatgac gtgacaccgc ccggcgtggt catggggctg gcctggaccg 1980 

caatgggagg ctccacgctg tttgtggaga catccctgag acggccacag gacaaggatg 2040 

ccaagggtga caaggatggc agcctggagg tgacaggcca gctgggggag gtgatgaagg 2100 

agagcgcccg catagcctac accttcgcca gagccttcct catgcagcac gcccccgcca 2160 

atgactacct ggtgacctca cacatccacc tgcatgtgcc cgagggcgcc acccccaagg 222 0 

acggcccaag cgcaggctgc accatcgtca cggccctgct gtccctggcc atgggcaggc 22 80 

ctgtccggca gaatctggcc atgactggcg aagtctccct cacgggcaag atcctgcctg 234 0 

ttggtggcat caaggagaag accattgcgg ccaagcgcgc aggggtgacg tgcatcgtcc 2400 

tgccagccga gaacaagaag gacttctacg acctggcagc cttcatcacc gagggcctgg 2460 

aggtgcactt cgtggaacac taccgggaga tcttcgacat cgccttcccg gacgagcagg 2520 

cagaggcgct ggccgtggaa cggtgacggc caccccggga ctgcaggcgg cggatgtcag 2580 

gccctgtctg ggccagaact gagcgctgtg gggagcgcgc ccggacctgg cagtggagcc 264 0 

accgagcgag cagctcggtc cagtgaccca gatcccaggg acctcagtcg gcttaatcag 2700 

agtgtggcat agaagctatt taatgattaa agtcatttgc agta 2744 

<210> 60 

<211> 2742 

<212> DNA 

<213> Homo sapien 

<400> 60 

cgaactccgc cccctcttct ccgcgtaggc ccagctccct gaagcggctg tttcgagcca 60 

cgcgcccatc gggtaccgag gcacgcgccg ggcgtcacgt gcgtttcgcg gcgagcggaa 120 

atgacgcgag ttgtgtgagc cgccagtatg gccgggctat ggcggcgagc actggctacg 180 

tgcgactgtg gggagcggcg cggtgctggg tgctgcggcg gccgatgctg gccgccgccg 240 

gggggcgggt tcccactgca gcaggagcgt ggttgctccg aggccagcgg acctgcgacg 300 

cctctcctcc ttgggcactg tggggccgag. gcccggcaat tgggggccaa tggcgggggt 360 

tttgggaagc gagcagccgc ggcggaggcg cattctcggg gggcgaggac gcctccgagg 420 

gcggcgcgga ggaaggagcc ggcggcgcgg ggggcagcgc gggcgccggg gaaggcccgg 4 80 
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tcataacggc gctcacgccc atgacgatcc ccgatgtgtt tccgcacctg ccgctcatcg 540 

ccatcacccg caacccggtg ttcccgcgct ttatcaagat tatcgaggtt aaaaataaga 600 

agttggttga gctgctgaga aggaaagttc gtctcgccca gccttatgtc ggcgtctttc 660 

taaagagaga tgacagcaat gagtcggatg tggtcgagag cctggatgaa atctaccaca 720 

cggggacgtt tgcccagatc catgagatgc aggaccttgg ggacaagctg cgcatgatcg 780 

tcatgggaca cagaagagtc catatcagca gacagctgga ggtggagccc gaggagccgg 84 0 

aggcggagaa caagcacaag ccccgcagga agtcaaagcg gggcaagaag gaggcggagg 900 

acgagctgag cgccaggcac ccggcggagc tggcgatgga gcccacccct gagctcccgg 96 0 

ctgaggtgct catggtggag gtagagaacg ttgtccacga ggacttccag gtcacggagg 1020 

aggtgaaagc cctgactgca gagatcgtga agaccatccg ggacatcatt gccttgaacc 1080 

ctctctacag ggagtcagtg ctgcagatga tgcaggctgg ccagcgggtg gtggacaacc 114 0 

ccatctacct gagcgacatg ggcgccgcgc tcaccggggc cgagtcccat gagctgcagg 12 00 

acgtcctgga agagaccaat attcctaagc ggctgtacaa ggccctctcc ctgctgaaga 126 0 

aggaatttga actgagcaag ctgcagcagc gcctggggcg ggaggtggag gagaagatca 1320 

agcagaccca ccgtaagtac ctgctgcagg agcagctaaa gatcatcaag aaggagctgg 13 80 

gcctggagaa ggacgacaag gatgccatcg aggagaagtt ccgggagcgc ctgaaggagc 144 0 

tcgtggtccc caagcacgtc atggatgttg tggacgagga gctgagcaag ctgggcctgc 1500 

tggacaacca ctcctcggag ttcaatgtca cccgcaacta cctagactgg ctcacgtcca 156 0 

tcccttgggg caagtacagc aacgagaacc tggacctggc gcgggcacag gcagtgctgg 1620 

aggaagacca ctacggcatg gaggacgtca agaaacgcat cctggagttc attgccgtta 1680 

gccagctccg cggctccacc cagggcaaga tcctctgctt ctatggcccc cctggcgtgg 174 0 

gtaagaccag cattgctcgc tccatcgccc gcgccctgaa ccgagagtac ttccgcttca 1800 

gcgtcggggg catgactgac gtggctgaga tcaagggcca caggcggacc tacgtgggcg 1860 

ccatgcccgg gaagatcatc cagtgtttga agaagaccaa gacggagaac cccctgatcc 192 0 

tcatcgacga ggtggacaag atcggccgag gctaccaggg ggacccgtcg tcggcactgc 198 0 

tggagctgct ggacccagag cagaatgcca acttcctgga ccactacctg gacgtgcccg 2040 

tggacttgtc caaggtgctg ttcatctgca cggccaacgt cacggacacc atccccgagc 2100 

cgctgcgaga ccgtatggag atgatcaacg tgtcgggcta cgtggcccag gagaagctgg 2160 

ccattgcgga gcgctacctg gtgccccagg ctcgcgccct gtgtggcttg gatgagagca 2220 

aggccaagct gtcatcggac gtgctgacgc tgctcatcaa gcagtactgc cgcgagagcg 2280 

gtgtccgcaa cctgcagaag caagtggaga aggtgttacg gaaatcggcc tacaagattg 2340 
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tcagcggcga 
agcccgtgtt 
tggcctggac 
tctacccgag 
tgaacggcct 
ggaaggctgt 
tttgataggc 



ggccgagtcc 
caccgtggag 
cgcaatgggt 
aacaagcagg 
ctggcggggg 
gacacatggg 
ttacaggtaa 



gtggaggtga 
cgcatgtatg 
gagcgtggtg 
tcccaccttg 
gtggtgtgct 
ggtggcagca 
ggacaagctc 



cgcccgagaa 
acgtgacacc 
gcgggagaag 
tacacctgtg 
gggagcccag 
caccatgggc 
atccagatgg 



cctgcaggac ttcgtgggga 2400 

gcccggcgtg gtcatggggc 2460 

accccaaagc cattcccatt 2520 

cagtgggcct caggtggctc 2580 

ggggtctggg ctgagggagg 264 0 

cttatggatt tgggtgccac 2700 

gc 2742 



<210> 61 

<211> 2116 

<212> DNA 

<213> Homo sapien 

<400> 61 

tgttgtactg gctggagtac agtggtgcag tctcagctca ctgcaacctc cgcctctcgg 
gttcaagcga ttctcctgcc tcagcctccc gagaagctgg aattacaggc atgcgccacc 
acgcccagct aattttgtat ttttagtaaa gatggtgttt ctccatgttg gtcaggctgg 
tctcgaactc ccgacctcag gtgatccgcc cacctcggcc tcccaaagtg ctggcatgac 
agagctagag ctcgggccca gccccaggct gcagcccatt cgcaggcacc cgaaagaact 
tccccagtat ggtggtcctg gaaaggacat ttttgaagat caactatatc ttcctgtgca 
ttccgatgga atttcagttc atcagatgtt caccatggcc accgcagaac accgaagtaa 
ttccagcata gcggggaaga tgttgaccaa ggtggagaag aatcacgaaa aggagaagtc 
acagcaccta gaaggcagcg cctcctcttc actctcctct gattagatga aactgttacc 
ttaccctaaa cacagtattt ctttttaact tttttatttg taaactaata aaggtaatca 
cagccaccaa cattccaagc taccctgggt acctttgtgc agtagaagct agtgagcatg 
tgagcaagcg gtgtgcacac ggagactcat cgttataatt tactatctgc caagagtaga 
aagaaaggct ggggatattt gggttggctt ggttttgatt ttttgcttgt ttgtttgttt 
tgtactaaaa cagtattatc ttttgaatat cgtagggaca taagtatata catgttatcc 
aatcaagatg gctagaatgg tgcctttctg agtgtctaaa acttgacacc cctggtaaat 
ctttcaacac acttccactg cctgcgtaat gaagttttga ttcattttta accactggaa 
tttttcaatg ccgtcatttt cagttagatg attttgcact ttgagattaa aatgccatgt 
ctatttgatt agtcttattt ttttattttt acaggcttat cagtctcact gttggctgtc 
attgtgacaa agtcaaataa acccccaagg acgacacaca gtatggatca catattgttt 
gacattaagc ttttgccaga aaatgttgca tgtgttttac ctcgacttgc taaaatcgat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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tagcagaaag 

gaagattgcc 

agattgctat 

ttcatctttt 

agttttgaat 

caacatacca 

ttcaccaagt 

tcttgaaaat 

tgagtcatca 

agaggttttc 

gtgtttatgg 

actttaaatc 

tgtatcatcg 

aaaagatttc 

atacttttat 

aaggtgttta 



gcatggctaa 

tgctctctct 

attttgggtt 

ttggttttct 

ttacaccaag 

caagagaagt 

tctgatatct 

atccttgttg 

gtaccctcct 

ttcttatttt 

taaactcttt 

tttatcatag 

gtgggatgac 

aagttattag 

agaacaattc 

aacttt 



taatgttggt 

gtgcctagcc 

attttcttga 

tggcatgact 

aacttctcaa 

taatttctta 

tttaaagaca 

tgtattaggt 

attcagctcc 

tagataattc 

taaagaaaat 

actctgtaca 

agaacaaaca 

gaagcatact 

tggcttcagg 



ggtgaaaata 

tcaaagcgtt 

caggagaaaa 

aagaagctta 

taaaagaaaa 

acattgtgtt 

tagttcaaaa 

ttttaaatac 

ccaagatgat 

aagtgcttag 

ttaatatgtt 

tatgttcaaa 

tatttatgat 

ctgtttttta 

aaagtctaga 



aataaataag 

catcatacat 

agatctaaag 

aatgttgata 

tcatgaatgc 

ctatgattat 

ttgcttttga 

cagctaaagg 

gtgtttttgc 

ataaattatg 

atagctgaat 

ttagctgctt 

catgaataat 

atcatgtata 

agcaatattt 



taaacaaaat 

cataccttta 

atcttttatt 

aaatatgact 

tccacaattt 

ttgtaagacc 

aaatctgtat 

attacctcac 

ttaccctaag 

ttttctttaa 

ctttttggta 

gcctgatgtg 

gtgctttgta 

atattccatg 

cttcaaataa 



<210> 62 

<211> 1855 

<212> DNA 

<213> Homo sapien 

<400> 62 

gggggcgggg aagtggcggc tgaagcgccg ccggcggggc tcactgtggt ggtgtgagtg 
ggaggcggcg gcgctggtgg ctgcagctgg ggtgaggcgc gaggcggcgc actcgacggc 
tgactggagc agcggtaaag gcgaggatgg agaccgaagg atataagtca aagagtacag 
cagaaaatgt ctactgaacg gacttcttgg acaagcctgt ccaccattca gaaaatagcc 
ctgggccttg ggatcccagc cagtgcaaca gttgcctata tcctataccg caggtatagg 
gaaagcagag aagagcggct gacatttgtt ggggaagatg acattgagat agagatgcgg 
gttccccagg aggctgtgaa actcatcatt ggccggcaag gagccaatat taaacagctg 
cggaaacaga caggtgctcg gattgatgtg gacacagagg atgtaggcga tgagcgagtg 
ctgcttatca gtggttttcc tgttcaggtg tgcaaggcca aagcagcaat ccatcagatc 
ctgacagaga ataccccagt gtctgagcag ctttcagttc cccagagatc tgtgggcaga 
atcataggga gaggcggcga gacaattcgt tctatctgta aggcatctgg agccaaaatt 



1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2116 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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acctgtgaca aagaatcaga agggacatta 


ctactatcaa 


gacttataaa 


aatctcagga 


720 


acacagaagg aagtggcagc agccaagcat 


ttgatactgg 


agaaagtttc 


agaagatgaa 


780 


gaacttcgga agagaattgc tcattctgca 


gaaaccaggg 


tcccacgcaa 


acagccaatc 


840 


agtgtgagaa gagaagacat gacagagcca ggtggagctg gagagccagc 


attatggaaa 


900 


aacaccagtt ctagcatgga gccgactgca 


cccctggtga 


ctcctccacc 


caaaggagga 


960 


ggcgacatgg ctgtggtagt gtcaaaggaa 


ggttcctggg 


agaaacctag 


tgatgacagc 


1020 


tttcagaagt ctgaagccca ggccatccca 


gagatgccca 


tgtttgaaag 


tatgtaacaa 


1080 


agagggagcc cattaatcat tgaagataga 


aatactggct 


tagatattgg agagattaga 


1140 


aggaatgcct tctcagtctg agcagccagt 


ataatatggc 


tggggtcagg acagaagatt 


1200 


cggtaagata ttttgaagat gggatggagg 


tagaatccac 


accaacagtt 


gctggacagt 


1260 


ttgaggatgc agatgttgat cactgaaaat 


gatttatgca 


ggtataagat 


tctgctccta 


1320 


attgtaggag agaacttggt gcctcttcca 


ctctgaagca 


aagttgatga 


aagtcttctt 


1380 


ccttttccca aaacccaacc tgaactactt 


ctttcttgga gacagactat 


attgagacaa 


1440 


gttgttacca gcaaaagata gatacatgac 


ctttattaac 


aaaaatgaat 


taaccaagag 


1500 


gatatttgta gtttattatt taccccaaac 


tttctgtgtc 


tgggtaccct 


ctgagtaggc 


1560 


ctataattcc tgccttcact gtatgcattt 


tatgtaagct 


agcagaccta 


tgtggtgaga 


1620 


atgcacagga gcttggaggt ataaatagac 


agggtgggaa 


agagagagct 


cctttcgcca 


1680 


tgttttacca gcccgctctg ttataacctc 


ttaggttata 


ttctttaatt 


tccaaccttt 


1740 


taggttagtt tctgtaacag aacaaatgag 


tctgggataa 


agtcctcaaa 


gtacttcaaa 


1800 


tggtaattgt tttgtttttg taacggctta 


acaaataacc 


taggttttct 


gttta 


1855 



<210> 63 

<211> 1136 

<212> DNA 

<213> Homo sapien 

<400> 63 

sgctaggtac aggccggkgt kaatcggccg aggcgcacgt gctggaaggc ccagacacca 60 

acaccacgat cattcagctg cagcccttgc aggagcccga gagctgggcg cgcacgcaga 120 

gtggcctgca gtcctacctg ctccagttcc acggcctcgt gcgcctggtg caccaggagc 180 

ggaccttggc ctttcctctg accatccgct gcttcctggg ctgtgagctg cctcccgagg 24 0 

gctctagagc ccatgtcttc ttcgaagtgg ctgtgaatgg gagctccttt gtgagtttcc 300 

ggccggagag agccttgtgg caggcagaca cccaggtcac ctccggagtg gtcaccttca 360 

ccctgcagca gctcaatgcc tacaaccgca ctcggtatga actgcgggaa ttcctggagg 420 
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acacctgtgt gcagtatgtg cagaaacata tttccgcgga aaacacgaaa gggagccaaa 480 

caagccgctc ctacacttcg ctggtcctgg gcgtcctggt gggcagtttc atcattgctg 54 0 

gtgtggctgt aggcatcttc ctgtgcacag gtggacggcg atgttaatta ctctccagcc 600 

ccgtcagaag gggctggatt gatggaggct ggcaagggaa agtttcagct cactgtgaag 660 

ccagactccc caactgaaac accagaaggt ttggagtgac agctcctttc ttctcccaca 720 

tctgcccact gaagatttga gggaggggag atggagagga gaggtggaca aagtacttgg 780 

tttgctaaga acctaagaac gtgtatgctt tgctgaatta gtctgataag tgaatgttta 84 0 

tctatctttg tggaaaacag ataatggagt tggggcagga agcctatggc ccatcctcca 900 

aagacagaca gaatcacctg aggcgttcaa aagatataac caaataaaca agtcatccac 960 

aatcaaaata caacattcaa tacttccagg tgtgtcagac ttgggatggg acgctgatat 1020 

aatagggtag aaagaagtaa cacgaagaag tggtggaaat gtaaaatcca agtcatatgg 108 0 

cagtgatcaa ttattaatca attaataata ttaataaatt tcttatattt aaggca 1136 

<210> 64 

<211> 2320 

<212> DNA 

<213> Homo sapien 

<400> 64 

cgggatagct gtcttgagcc ccaagcctct tcctcccctg ctgcccctct gcagccattc 60 

gggatgggac cccctctggg gtgtcagcac gaaagggcta acgggagccc cttccttggc 120 

ctccccctgt aggttacaga gccatcacgg tgcggaagct catgcagggc atgggccttg 180 

gcctctccag cgtctttgct ctgtgcctgg gccacacctc cagcttctgt gagtctgtgg 240 

tctttgcatc agcctccatc ggcctccaga ccttcaacca cagtggcatt tctgttaaca 300 

tccaggactt ggccccgtcc tgcgccggct ttctgtttgg tgtggccaac acagccgggg 360 

ccttggcagg tgtcgtgggt gtgtgtctag gcggctactt gatggagacc acgggctcct 420 

ggacttgcct gttcaacctt gtggccatca tcagcaacct ggggctgtgc accttcctgg 480 

tgtttggaca ggctcagagg gtggacctga gctctaccca tgaggacctc tagctcccaa 540 

ccccacagcc tctccaagga cccaggcgcc agcagccccg ggacacaggg gactcagtgt 600 

gtgggacttg gtcactccat gtcagacaca cgagcagaga ggaacacaaa ccactgtgga 660 

gcctgaagct ccttaagaag agtccacaac agctggtggg agggtggggt gggcctgggt 720 

ccagaccagg ctcgctgctc tctgggcctc agtttcccca cctgccagcg ggctcggccc 780 

tgtcctcctc acaggctggt gtggccgtca gggtgggtgg ggttattgtt agtaggcgca 840 

gcctcattcc caccacgatc tgttccgcgt ggttcccgcc aaacctccct cggtcgccgt 900 
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gttctccgca agcctcctgc agcgcccgcc tgccaatgtg aggctggcac caggctgcag 960 

cctccccaat cccagcccac tttgctgtgt ctctggcggg ctgtcctcct tggtgggagc X020 

tgtcctgcac actgtaggat gcttaaaggt atccctggcc tccacccatc cctagccagc 1080 

agctcccagt cagacaacag ccagaaatgt ctccagactc tgcccagcct ccccaggtag 1140 

ccaccctcga gacatgacct cagagtctct gtgtctccta gaagcctgac agagaccccc 1200 

agggcagtgg gtgggtggcg ggctagagac ccttgcctgt gtccgggacc ctggcgccgc 1260 

tctcccctcc tgtggatccc tccgcactaa cagtgttctc agtgggcaga cgcctgggca 1320 

ccccttgggc cctgcccagc atggccatgg cgcaggctct cgaacccgca tggctttccc 1380 

aggcctggtg attctgctct ccagggacgg ttggcacctt cctcgggggc gggccccacg 1440 

caccccagaa cacacagacc cacctttctg gcgttctttc tacctccctt ttcgttgcct 1500 

gaggagctgg tggtttcatg agttaatgat acatcttgca aggtgtacac atagagaaaa 1560 

aaacctaaaa atgtggaaaa gcacgccaaa gccttattta aataataact attaaactat 162 0 

tcaaaaagaa aaaaaaaaaa caaacaaaaa acataaaaac aaaaaaacaa aaagaaaaaa 1680 

aaaaaaaaca aaacaggggg gcgggcgcgc cccagaagaa atcccgcccg cggggggcgg 174 0 

ccccaataac cgcgcgacca cccggcttcc ttgttagcac acaacagggg ggggcgcccc 1800 

acaagaacga gcgagcgcac cgtagtcata actatacgca gcgcgagctc cgatggccgc 1860 

gcccacacac ctcaaccaca cagtcgtggg tacaccctgc ggcaaccaaa cgctggccac 1920 

aacccggcgg aaacatcttg cgtggagaga aaggcgaccc gctcacacta ccccagcgag 1980 

agcgagggac ggcaacccaa acatcggtgt gggggccgcc gacaaacctc cctcactcct 204 0 

caaccatgcg cgacgatctt cattaccaaa tcgcccccct cgctcgacgg gcggcgacga 2100 

atcgctcatc acacacaatc catcttattc acacggccgt cgcgctatcc tccacgcgtg 2160 

tgtggtgcct catcacaccg aacgacgggt cgcatcaata atatgtcgac ccggcccgcg 2220 

ctagctcgtc aggaaaatga agatttcgtg ctggctcaat cttaggaggc agaaattgca 2280 

ctcgatcgtt cgtcgctgac gaacacacat ttctttctcc 232 0 

<210> 65 

<211> 1609 

<212> DNA 

<213> Homo sap i en 



<400> 65 

agcggggaga aacgttgcag cccgcgccga acgccgggca gcacaaagga tccccgactg 



ccggggagcg gtgctcggag ggcacaggtc tacgccatcc cccacgcagt ttcggagatg 



gagcgctggg cccatggagg gaaggcggca ggctcggcgg ctccggcagc ttgctggggc 



60 
12 0 
180 
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aggggctcaa ggcggcagtc cgatagtgga ggccgctgag aactgtcacg gagctgcgtc 240 

tgtacagcga gcatccctta tttattcagg gcgagtgtgt atttggggcg gcgtgcaggg 300 

ggctgacaaa gaccggagag ctcccggtgc ggccgccggc ggagcgaaga ctggaacccg 360 

tatgagcgcc ccccagcgcc cctgagcgct cgccgccggt gcacggcgca ccccgcggga 420 

ggcagggatc agcaaagccg tgcgccccga ggcccgcccc cgtctccgca caaagaccga 48 0 

gctggaggat cttcagaaga agcctccccc atacctgcgg aacctgtcca gcgatgatgc 540 

caatgtcctg gtgtggcacg ctctcctcct acccgaccaa cctccctacc acctgaaagc 600 

cttcaacctg cgcatcagct tcccgccgga gtatccgttc aagcctccca tgatcaaatt 660 

cacaaccaag atctaccacc ccaacgtgga cgagaacgga cagatttgcc tgcccatcat 720 

cagcagtgag aactggaagc cttgcaccaa gacttgccaa gtcctggagg ccctcaatgt 780 

gctggtgaat agaccgaata tcagggagcc cctgcggatg gacctcgctg acctgctgac 840 

acagaatccg gagctgttca gaaagaatgc cgaagagttc accctccgat tcggagtgga 900 

ccggccctcc taactcatgt tctgaccctc tgtgcactgg atcctcggca tagcggacgg 960 

acacacctca tggactgagg ccagagcccc ctgtggccca ttccccattc atttttccct 102 0 

tcttaggttg ttagtcatta gtttgtgtgt gtgtgtggtg gagggaaggg agctatgagt 1080 

gtgtgtgttg tgtatggact cactcccagg ttcacctggc cacaggtgca cccttcccac 1140 

accctttaca ttccccagag ccaagggagt ttaagtttgc agttacaggc cagttctcca 1200 

gctctccatc ttagagagac aggtcacctt gcaggcctgc ttgcaggaaa tgaatccagc 1260 

agccaactcg aatcccccta gggctcaggc actgagggcc tggggacagt ggagcatatg 1320 

ggtgggagac agatggaggg taccctattt acaactgagt cagccaagcc actgatggga 13 80 

atatacagat ttaggtgcta aaccgtttat tttccacgga tgagtcacaa tctgaagaat 1440 

caaacttcca tcctgaaaat ctatatgttt caaaaccact tgccatcctg ttagattgcc 1500 

agttcctggg accaggcctc agactgtgaa gtatatatcc tccagcattc agtccagggg 1560 

gagccacgga aaccatgttc ttgcttaagc cattaaagtc agagatgaa 1609 

<210> 66 

<211> 1414 

<212> DNA 

<213> Homo sapien 



<400> 66 

tcctgcctga aacttgtgat aacaactggg tgcatctcca gtgcaggaaa agtccttggc 



atcttggtgc caggcattct gctactgctg gtggtgggtt atgaacacag atgaccagga 



tgaagtcctt gccttggaat aagcccatag cccatcccat aatttgatgc ctgtattact 



60 
12 0 
180 



WO 2004/050900 



PCTYUS2003/040131 



111 

agagtcctga gaaacacgga atgctgatta gaatttggga caaagttccc tgagtattct 240 

ctggaggaaa agatgcggag gtggacctgg aaggcctcct ggagagctgg aggatcttca 300 

gaagaagcct cccccatacc tgcggaacct gtccagcgat gatgccaatg tcctggtgtg 360 

gcacgctctc ctcctacccg accaacctcc ctaccacctg aaagccttca acctgcgcat 420 

cagcttcccg ccggagtatc cgttcaagcc tcccatgatc aaattcacaa ccaagatcta 480 

ccaccccaac gtggacgaga acggacagat ttgcctgccc atcatcagca gtgagaactg 540 

gaagccttgc accaagactt gccaagtcct ggaggccctc aatgtgctgg tgaatagacc 600 

gaatatcagg gagcccctgc ggatggacct cgctgacctg ctgacacaga atccggagct 660 

gttcagaaag aatgccgaag agttcaccct ccgattcgga gtggaccggc cctcctaact 720 

catgttctga ccctctgtgc actggatcct cggcatagcg gacggacaca cctcatggac 780 

tgaggccaga gccccctgtg gcccattccc cattcatttt tcccttctta ggttgttagt 840 

cattagtttg tgtgtgtgtg tggtggaggg aagggagcta tgagtgtgtg tgttgtgtat 900 

ggactcactc ccaggttcac ctggccacag gtgcaccctt cccacaccct ttacattccc 960 

cagagccaag ggagtttaag tttgcagtta caggccagtt ctccagctct ccatcttaga 1020 

gagacaggtc accttgcagg cctgcttgca ggaaatgaat ccagcagcca actcgaatcc 1080 

ccctagggct caggcactga gggcctgggg acagtggagc atatgggtgg gagacagatg 1140 

gagggtaccc tatttacaac tgagtcagcc aagccactga tgggaatata cagatttagg 1200 

tgctaaaccg tttattttcc acggatgagt cacaatctga agaatcaaac ttccatcctg 126 0 

aaaatctata tgtttcaaaa ccacttgcca tcctgttaga ttgccagttc ctgggaccag 1320 

gcctcagact gtgaagtata tatcctccag cattcagtcc agggggagcc acggaaacca 13 80 

tgttcttgct taagccatta aagtcagaga tgaa 1414 

<210> 67 

<211> 1243 

<212> DNA 

<213> Homo sapien 



60 



<400> 67 

agcggggaga aacgttgcag cccgcgccga acgccgggca gcacaaagga tccccgactg 

ccggggagcg gtgctcggag ggcacaggtc tacgccatcc cccacgcagt ttcggagatg 120 

gagcgctggg cccatggagg gaaggcggca ggggaacctg tccagcgatg atgccaatgt 180 

cctggtgtgg cacgctctcc tcctacccga ccaacctccc taccacctga aagccttcaa 240 

cctgcgcatc agcttcccgc cggagtatcc gttcaagcct cccatgatca aattcacaac 3 00 

caagatctac caccccaacg tggacgagaa cggacagatt tgcctgccca tcatcagcag 360 
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112 

tgagaactgg aagccttgca ccaagacttg ccaagtcctg gaggccctca atgtgctggt 420 

gaatagaccg aatatcaggg agcccctgcg gatggacctc gctgacctgc tgacacagaa 480 

tccggagctg ttcagaaaga atgccgaaga gttcaccctc cgattcggag tggaccggcc 540 
ctcctaactc atgttctgac cctctgtgca ctggatcctc ggcatagcgg acggacacac 
ctcatggact gaggccagag ccccctgtgg cccattcccc attcattttt cccttcttag 

gttgttagtc attagtttgt gtgtgtgtgt ggtggaggga agggagctat gagtgtgtgt 720 

gttgtgtatg gactcactcc caggttcacc tggccacagg tgcacccttc ccacaccctt 780 

tacattcccc agagccaagg gagtttaagt ttgcagttac aggccagttc tccagctctc 840 

catcttagag agacaggtca ccttgcaggc ctgcttgcag gaaatgaatc cagcagccaa 900 

ctcgaatccc cctagggctc aggcactgag ggcctgggga cagtggagca tatgggtggg 960 

agacagatgg agggtaccct atttacaact gagtcagcca agccactgat gggaatatac 1020 

agatttaggt gctaaaccgt ttattttcca cggatgagtc acaatctgaa gaatcaaact 1080 

tccatcctga aaatctatat gtttcaaaac cacttgccat cctgttagat tgccagttcc 1140 

tgggaccagg cctcagactg tgaagtatat atcctccagc attcagtcca gggggagcca 1200 

cggaaaccat gttcttgctt aagccattaa agtcagagat gaa 1243 

<210> 68 

<211> 1507 

<212> DNA 

<213> Homo sapien 

<400> 68 

gggggtgggg tccccggggc ggggcggggc gcgctgtgtc gcgggtcgga gctcggtcct 60 

gctggaggcc acgggtgcca cacactcggt cccgacatga tggcgagcat gcgagtggtg 120 

aaggagctgg aggatcttca gaagaagcct cccccatacc tgcggaacct gtccagcgat 180 

gatgccaatg tcctggtgtg gcacgctctc ctcctacccg aggcagaggt tgcggtgagc 240 

cgagatcacg ccattgcact ccagcctggg cagcaaagtg aaactccatc tcaaaaaaaa 300 

aaaaaaaaag aagcttggca ccagcatctc cttctgatga ggccctcagg aagcttccac 360 

tcatggtgga aggcaaaggg gagccacgtg tacagatcac attgcaagag aagaagtgaa 420 

agagagagag agcgagcagg tgccaggatc ttctaaacaa ccagcttttt cagaccaacc 480 

tccctaccac ctgaaagcct tcaacctgcg catcagcttc ccgccggagt atccgttcaa 540 

gcctcccatg atcaaattca caaccaagat ctaccacccc aacgtggacg agaacggaca 600 

gatttgcctg cccatcatca gcagtgagaa ctggaagcct tgcaccaaga cttgccaagt 660 

cctggaggcc ctcaatgtgc tggtgaatag accgaatatc agggagcccc tgcggatgga 720 
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cctcgctgac ctgctgacac agaatccgga gctgttcaga aagaatgccg aagagttcac 780 

cctccgattc ggagtggacc ggccctccta actcatgttc tgaccctctg tgcactggat 840 

cctcggcata gcggacggac acacctcatg gactgaggcc agagccccct gtggcccatt 900 

ccccattcat ttttcccttc ttaggttgtt agtcattagt ttgtgtgtgt gtgtggtgga 960 

gggaagggag ctatgagtgt gtgtgttgtg tatggactca ctcccaggtt cacctggcca 1020 

caggtgcacc cttcccacac cctttacatt ccccagagcc aagggagttt aagtttgcag 1080 

ttacaggcca gttctccagc tctccatctt agagagacag gtcaccttgc aggcctgctt 1140 

gcaggaaatg aatccagcag ccaactcgaa tccccctagg gctcaggcac tgagggcctg 1200 

gggacagtgg agcatatggg tgggagacag atggagggta ccctatttac aactgagtca 1260 

gccaagccac tgatgggaat atacagattt aggtgctaaa ccgtttattt tccacggatg 1320 

agtcacaatc tgaagaatca aacttccatc ctgaaaatct atatgtttca aaaccacttg 13 80 

ccatcctgtt agattgccag ttcctgggac caggcctcag actgtgaagt atatatcctc 1440 

cagcattcag tccaggggga gccacggaaa ccatgttctt gcttaagcca ttaaagtcag 1500 

agatgaa 1507 

<210> 69 

<211> 1279 

<212> DNA 

<213> Homo sapien 

<400> 69 

cagcggccgg ccgcggggcc tttttgtcct gagggccaga gaaatggaga agggggtggg 60 

gggacagcca cgtggccgca ggaggattta caacattttc tttcgccatc gatgttatcg 120 

caaaatgtgt gagagaagcg gctgcgcagc ccggacggga gcgtgagggt gcgggccagg 180 

taagcagccc cggcggtttc gccgcatacg ggactgcggg gcgaccgcgg gcaccagcca 240 

cgcgcagcgg ctccgcgggg tctcggccgg gtccgcgctc tgaaggatct cgagagccat 300 

ggatggtgca ggcgggaccc tcgagctgca gcatctccgg tgacccgggg ttgccgagga 360 

ggtggagacc agcacaggtg gtccggcccg ggcgcctccg aatccggggg tggtcaagac 420 

ggatccccaa ggctgaggtc ggcagtcccg gggactcgca gctgttgagc ctgtggagac 480 

gcggccccgt gaccgaggca cccttcagca acccgggggc agcgttccag agactgaact 540 

tctcaaatca ctgcttcaac agcttttaaa aatctggacc tagttactcc tgtcatctat 600 

gtgtaaagat ttagaaaaaa aatcccaaac ccaagggtgg gcagccgcga ggatgtacag 660 

ctcctggcag tgtgcgctca cccggagacc tgaggtcaaa gcccaaatgc tgagcacctc 720 

ctgggaggcg ggagcatggt ctgctcagtg ccagacatct ggatattgag aaaacatctg 780 
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ggtagctggg gttttcaggt 


ttctgcctga 


catttaatat 


cacaaacatt 


gagcctgctc 


840 


ctgtccccca ccagccacgc 


ggcttctcct 


cccaaacata 


cccagctcct 


tcccctgcct 


900 


tcctgacggg taagaaacat 


ccattctttc 


caattcccca 


gcgttttccc 


cctaccggaa 


960 


atctgatggg cttatgacat 


catggctggc 


tgctgagcga 


tgaagtggat 


gccacaaaga 


1020 


aatccgacat atcagataga 


ttctgaaatc 


ggtttccctc 


cagctgtagt 


aacaggcgtg 


1080 


aagtcaggag aatttgagct 


ttgtttaaaa 


aataaataaa 


taaataaata 


aaccataaca 


1140 


aagtcttgcc ctgtattaaa 


tgcaattttc 


ttaaaaacaa 


gcaaaccttt 


tggacatcat 


1200 


tttattttaa tagaaatgct gagttttatg aaactaaagt ggctaataaa tcagacctga 


1260 


agctttgtgt gagtgttcc 










1279 



<210> 70 

<211> 517 

<212> DNA 

<213> Homo sapien 

<400> 70 

cagcggccgg ccgcggggcc tttttgtcct gagggccaga gaaatggaga agggggtggg 60 

gggacagcca cgtggccgca ggaggattta caacattttc tttcgccatc gatgttatcg 12 0 

caaaatgtgt gagagaagcg gctgcgcagc ccggacggga gcgtgagggt gcgggccagc 180 

gttttccccc taccggaaat ctgatgggct tatgacatca tggctggctg ctgagcgatg 240 

aagtggatgc cacaaagaaa tccgacatat cagatagatt ctgaaatcgg tttccctcca 300 

gctgtagtaa caggcgtgaa gtcaggagaa tttgagcttt gtttaaaaaa taaataaata 360 

aataaataaa ccataacaaa gtcttgccct gtattaaatg caattttctt aaaaacaagc 420 

aaaccttttg gacatcattt tattttaata gaaatgctga gttttatgaa actaaagtgg 480 

ctaataaatc agacctgaag ctttgtgtga gtgttcc 517 



<210> 71 

<211> 736 

<212> DNA 

<213> Homo sapien 

<220> 

<221> mis cofeature 

<222> (46) . . (46) 

<223> n=a, c, g or t 



<400> 71 

gctccccaaa gggtacctgc ttctgctggc ttaatgcctc agaacnttgg tgtcattggt 
ctcagacacc actttgccat ccactatccg gcgggtggtg gtctttggga tggtttgcat 



60 
120 
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ggagttgctg ctgtccaagg catcaccaag attaaagtcc tcgccatcta cagcttctgc 180 

tcccagatct gtgacccccg gacgaccggc gcgttctggc agacgtggaa ggactttgag 240 

gtccggcatg gcaatgagga caccatcaag gaaatgctgc gtatccggcg cagcgtgcag 300 

gccacgtaca acacgcaggt caacttcatg gcctcgcaga tgctcaaggt ctcgggcagt 360 

gccacgggca ccgtgtctga cctggcccct gggcagagtg gcatggacga catgaagctg 420 

ctggaacagc gggcagagca gctggcggct gaggcggagc gtgaccagcc cttgcgcgcc 480 

cagagcaaga tcctgttcgt gaggagtgac gcctcccggg aggagctggc agagctggca 54 0 

cagcaggtca accccgagga gatccagctg ggcgaggacg aggacgagga cgagatggac 600 

ctggagccca acgaggttcg gctggagcag cagagcgtgc cagccgcagt gtttgggagc 660 

ctgaaggaag actgacccgt ccctccccca tcccccctcc ccaccccctc cccaatacag 72 0 

ctacgtttgt acatca 736 

<210> 72 

<211> 3041 

<212> DNA 

<213> Homo sapien 

<400> 72 

aatgatagat cactataggg gaatgggtcc tctaatgctg ctcgagcggc gcagtgtgat 60 

ggatcgtggt cgcggcgagg taccagtatt ctccacagaa ccagcatgta gagcagcagc 120 

cacactacac ccacaaacca actctggaat acagtccttt tcccatacct ccccagtccc 180 

ccgcttatga accaaacctc tttgatggtc cagaatcaca gttttgccca aaccaaagct 240 

tagtttccct tcttggtgat caaagggaat ctgagaatat tgctaatccc atgcagactt 300 

cctccagtgt tcagcagcaa aatgatgctc acttgcacag cttcagcatg atgcccagca 3 60 

gcgcctgtga ggccatggtg gggcacgaga tggcctctga ctcttcaaac acttcactgc 420 

cattctcaaa catgggaaat ccaatgaaca ccacacagtt agggaaatca ctttttcagt 480 

ggcaggtgga gcaggaagaa agcaaattgg caaatatttc ccaagaccag tttctttcaa 540 

aggatgcaga tggtgacacg ttccttcata ttgctgttgc ccaagggaga agggcacttt 600 

cctatgttct tgcaagaaag atgaatgcac ttcacatgct ggatattaaa gagcacaatg 660 

gacagagtgc ctttcaggtg gcagtggctg ccaatcagca tctcattgtg caggatctgg 720 

tgaacatcgg ggcacaggtg aacaccacag actgctgggg aagaacacct ctgcatgtgt 780 

gtgctgagaa gggccactcc caggtgcttc aggcgattca gaagggagca gtgggaagta 84 0 

atcagtttgt ggatcttgag gcaactaact atgatggcct gactcccctt cactgtgcag 900 

tcatagccca caatgctgtg gtccatgaac tccagagaaa tcaacagcct cattcacctg 960 
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aagttcagga gcttttactg aagaataaga gtctggttga taccattaag tgcctaattc 
aaatgggagc agcggtggaa gcgaaggatc gcaaaagtgg ccgcacagcc ctgcatttgg 
cagctgaaga agcaaatctg gaactcattc gcctcttttt ggagctgccc agttgcctgt 
cttttgtgaa tgcaaaggct tacaatggca acactgccct ccatgttgct gccagcctgc 
agtatcggtt gacacaatta gatgctgtcc gcctgttgat gaggaaggga gcagacccaa 
gtactcggaa cttggagaac gaacagccag tgcatttggt tcccgatggc cctgtgggag 
aacagatccg acgtatcctg aagggaaagt ccattcagca gagagctcca ccgtattagc 
tccattagct tggagcctgg ctagcaacac tcactgtcag ttaggcagtc ctgatgtatc 
tgtacataga ccatttgcct tatattggca aatgtaagtt gtttctatga aacaaacata 
tttagttcac tattatatag tgggttatat taaaagaaaa gaagaaaaat atctaatttc 
tcttggcaga tttgcatatt tcatacccag gtatctggga tctagacatc tgaatttgat 
ctcaatggta acattgcctt caattaacag tagcttttga gtaggaaagg actttgattt 
gtggcacaaa acattattaa tatagctatt gacagtttca aagcaggtaa attgtaaatg 
tttctttaag aaaaagcatg tgaaaggaaa aaggtaaata cagcattgag gcttcatttg 
gccttagtcc ctgggagtta ctggcgttgg acaggcttca gtcattggac tagatgaaag 
gtgtccatgg ttagaatttg atctttgcaa actgtatata attgttattt ttgtccttaa 
aaatattgta catacttggt tgttaacatg gtcatatttg aaatgtataa gtccataaaa 
tagaaaagaa caagtgaatt gttgctattt aaaaaaattt tacaattctt actaaggagt 
ttttattgtg taatcactaa gtctttgtag ataaagcaga tggggagtta cggagttgtt 
cctttactgg ctgaaagata tattcgaatt gtaaagatgc tttttctcat gcattgaaat 
tatacattat ttgtagggaa ttgcatgctt tttttttttt ttctcccgag acagggtctt 
gctctggcgc ccaggctgga gtacagtggc atgatcttgg ctcacttcag ccttgacttg 
ggctcaagtg atcctcctac ctgagccttc tgagtaactg ggactacagg tgtgcactcc 
tcgcctggct aattttttat tttttgtaca ggcaggatct tgccaccttg cccaggctgg 
tcttgaactc ctgagctcat gccatctgcc tgccttagtc tcccaaaatg ctgggattac 
aggagtgagc caccatgccc ggctggcagt tgcatggaag agaacacctc tttatggctt 
accctctaga atttctaatt tatgtgttct gttgaaattt ttgttttttt acctttattg 
aaacaacaaa aagtcagtat tgaaacatat cttcctgttt tctgttgtca aatgatgata 
atgtgccatg atgttttata tatatcattc agaaaaagtt ttatttttta ataacattct 
attaacatta ttttgcttgc cgctggcatg cctgaggaat gtatttggct ttgattacac 
actaagtttt tgtaataaat ttgactcatt aaaaaccttt tttttttaaa aaaaaaaaaa 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 
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agaaaatctc attagtgaac ttatctttgc agctgagtac ttaaattctt tttaaaaaga 2880 

taccctttgg attgatcaca ttgtttgacc cagtatgtct tgtagacacg ttagttataa 2940 

tcaccttgta tctctaaata tggtgtgata tgaaccagtc cattcacatt ggaaaaactg 3000 

atggttttaa ataaactaat tcactaatat tatttgtctt a 3041 

<210> 73 
<211> 1193 
<212> DNA 
<213> Homo sapien 

<400> 73 

acatccagct gcctgagacc ctcctgcagc cttctcaagg gacagcccca ctctgcctct 60 

tgctcctcca gggcagcacc atgcagcccc tgtggctctg ctgggcactc tgggtgttgc 120 

ccctggccag ccccggggcc gccctgaccg gggagcagct cctgggcagc ctgctgcggc 180 

agctgcagct caaagaggtg cccaccctgg acagggccga catggaggag ctggtcatcc 240 

ccacccacgt gagggcccag tacgtggccc tgctgcagcg cagccacggg gaccgctccc 3 00 

gcggaaagag gttcagccag agcttccgag aggtggccgg caggttcctg gcgttggagg 360 

ccagcacaca cctgctggtg ttcggcatgg agcagcggct gccgcccaac agcgagctgg 420 

tgcaggccgt gctgcggctc ttccaggaat gtacattgac ctgcagggga tgaagtgggc 480 

cgagaactgg gtgctggagc ccccgggctt cctggcttat gagtgtgtgg gcacctgccg 540 

gcagcccccg gaggccctgg ccttcaagtg gccgtttctg gggcctcgac agtgcatcgc 600 

ctcggagact gactcgctgc ccatgatcgt cagcatcaag gagggaggca ggaccaggcc 660 

ccaggtggtc agcctgccca acatgagggt gcagaagtgc agctgtgcct cggatggtgc 720 

gctcgtgcca aggaggctcc agccataggc gcctagtgta gccatcgagg gacttgactt 78 0 

gtgtgtgttt ctgaagtgtt cgagggtacc aggagagctg gcgatgactg aactgctgat 840 

ggacaaatgc tctgtgctct ctagtgagcc ctgaatttgc ttcctctgac aagttacctc 900 

acctaatttt tgcttctcag gaatgagaat ctttggccac tggagagccc ttgctcagtt 960 

ttctctattc ttattattca ctgcactata ttctaagcac ttacatgtgg agatactgta 1020 

acctgagggc agaaagccca atgtgtcatt gtttacttgt cctgtcactg gatctgggct 1080 

aaagtcctcc accaccactc tggacctaag acctggggtt aagtgtgggt tgtgcatccc 114 0 

caatccagat aataaagact ttgtaaaaca tgaataaaac acattttatt eta 1193 

<210> 74 

<211> 1725 

<212> DNA 

<213> Homo sapien 
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<400> 74 

ttttttttca cctaaattgt ttttacttgc ttcaccttaa atcagttatt cggttccttt 
gaagaacatg tagctttagc aacccccaaa gtgaacccag atacccagaa ctactgattg 

ttacatcaga gtttttgaaa acgccaatac tggaaacagg ccgacagtta aggagaggca 180 

ggcctcaaac taaagcagaa ccttctcaaa actacttata tttcaagaga tggattaagc 24 0 

gaaaagagct gtacaaagaa ataagccatt ttctatgcgc ctgttatatg caaggcactg 3 00 

cactcccttg tcacttgtta atttacaaga agactcgccg gtccagggaa gggagagggg 360 

aataggtgca aggtttgtgg agggcggcaa actgcactcc atgtggcgcg aggctgcgct 420 

tgcggagatc cgggttctgg tgcctggcgc cggcagccag taggggcgcg cgtcctgcgc 480 

agttgccgcg cgcggcctcc tcgggcttca ggccgggcca atcaggaggc gggcgggagg 540 

gggcgcggta ccaggaggga acgggacggg tcgggcttcg aggtcagctg gtccgcaggg 600 

aagcctttgg cttccccacc ggcaatggag cggttgacgt tgcctctcgg cggcgcggcg 660 

gcggtggacg agtacctgga gtaccggagg tataaacaac ataaaactga cttggaagcg 720 

attcctcagc agtgccccat tgatctgccc tgccaagtga ctggctgcca gtgcagggct 780 

tacctttatg tccccttgaa tggtagccag cccattcgct gcaggtgcaa acactttgct 840 

gatcagcaca gtgctgcgcc tggctttaca tgcaatacat gttccaagtg ttcaggattc 900 

catagctgct tcacttgtgc ttgtggtcag cctgcatatg cccatgacac agtagtggaa 960 

actaagcaag aaagattggc tcaggaaaaa ccagtgggac aggacattcc ttatgcagcc 1020 

atgggaggat taactggttt cagctcgctg gcggaaggct acatgcggtt agatgacagt 1080 

gggattgtag gtacaagtag tcaagtttct tcattaagga gacctgaaga ggatgatatg 1140 

gctttctttg aaagacgata ccaggaaagg atgaaaatgg aaaaggctgc taagtggaaa 12 00 

ggaaaagctc cattgccatc agctacaaaa ccttcatgaa gactattgga gaaattaaaa 1260 

ccatcatcca agtatctttt tcatgtttat ttaaatgtaa taatacagtt tatttttcct 1320 

gaaattattt actttttttt tttactgtat aaatgtcttt tgggatgttt ccttaattta 1380 

tttaaataac taaaaatgtc tattactttt gtcaaaactc ataatttact actttgtatg 1440 

tacctttctt tctcctgaca aatgaggtta ttttatatga gtctgtctga gagtacagta 1500 

aatgttttta gtacataata atttaactgt ttcaggtatt taaaaaatta aagatattat 1560 

caagggtttt ggacaaacat atgagccatt tttttgtcat tcaaaatagt aagttaaaaa 1620 

caagagaaca aaagaaaaaa atagttaaaa tcattaattt ttttattttt caaactttgt 1680 

aatgttatgt tttcaaataa aaactatctc aaaattttac aacca 1725 
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<210> 75 

<211> 670 

<212> DNA 

<213> Homo sapien 

<400> 75 

ggaggttcag gaagccattt tctttcgcgt ctgcggtgct cggagtgtgg tacttctcct 60 

agttgcagtc aggcttcata cgctattgtc ctgcccgtta gagcagccag cgggtacaga 12 0 

atggattttg gaagagggag tcaccactgg acctccaagg aagccacgtg cagacatcta 180 

caaccttcga tctcctgacg agtttattgt tggccaaaac caggctttga ttgaaccagg 240 

atgaatgcgg gtgttggaag tagaatatat atatacatat aaaattggtt gggagccacg 300 

tgtaccagtg tgtgttgatc ttggcttgat tcagtctgcc ttgtaacaga aactggcgat 360 

ggaatatgag aggagccctc tggaaagaaa aggacagacc ctgtgctttc atgaaagtga 420 

agatctggct gaaccagttc cacaaggtta ctgtatacat agcctgagtt taaaaggctg 480 

tgcccacttc aagaatgtca ttgttagact ttgaaatttc taactgccta cctgcataaa 540 

gaaaataaaa tcttttaaat caaaaaaaaa acaaaaacaa aaaaacttgt ggtggggggc 600 

cttgggcccc ggaaaaggtt ttttaaccct tccgttgggg ggggggggcc ccttggtttg 660 
cggccccctg 



670 



<210> 76 

<211> 964 

<212> DNA 

<213> Homo sapien 

<400> 76 

ggaggttcag gaagccattt tctttcgcgt ctgcggtgct cggagtgtgg tacttctcct 60 

agttgcagtc aggcttcata cgctattgtc ctgcccgtta gagcagccag cgggtacaga 120 

atggattttg gaagagggag tcaccactgg acctccaagg aagccacgtg cagacatcta 180 

caaccttcga tctcctgacg agtttattgt tggccaaaac caggctttga ttgaaccagg 240 

atgaatgcgg gtgttggaag tagaatatat atatacatat aaaattgaaa ctggcgatgg 3 00 

aatatgagag gagccctctg gaaagaaaag gacagaccct gtgctttcat gaaagtgaag 360 

atctggctga accagttcca caaggttact gtatacatag cctgagttta aaaggctgtg 420 

cccacttcaa gaatgtcatt gttagacttt gaaatttcta actgcctacc tgcataaaga 480 

aaataaaatc ttttaaatca aaaaaaaaaa aaaggggggg ccgccctttt tttttttttt 540 

tttgggaggt ccccaacaat tcttgggtaa ttctttcccc aaaaggtaaa tatacccgaa 600 

gggtttttaa agttccacaa aaaaaacctt atttgggttc cttccccaaa gaaaaaaaaa 660 

atacatgaga gcaacagccc agcccgatag acagcacaga gacaactcgc aaccaacaag 720 
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tccgcccaac gacaagcaga aacaaaagga gggcggcgag ccaacacatc agcaaggcga 780 
caagacccac agccaagagg caggcggccg acagtgacat aacaacgagc ggccccacgg 840 
caacgacaac cgacgataag aacgacgtgt gtgaagacac accccaccgc cgaacgacgg 900 
gatggcacca gcgagaccta tagtacgcga agcagccaag acgaagagcg gggaagcaag 960 

964 

gaaa 

<210> 77 

<211> 269 

<212> DNA 

<213> Homo sapien 

<400> 77 „ 
cctctctggc ggcggctgct gctagggagt cgacttctcc tgccttgtaa cagaaactgg 6 0 

cgatggaata tgagaggagc cctctggaaa gaaaaggaca gaccctgtgc tttcatgaaa 120 

gtgaagatct ggctgaacca gttccacaag gttactgtat acatagcctg agtttaaaag 180 

gctgtgccca cttcaagaat gtcattgtta gactttgaaa tttctaactg cctacctgca 240 

269 

taaagaaaat aaaatctttt aaatcaaaa 

<210> 78 

<211> 928 

<212> DNA 

<213> Homo sapien 

<220> 

<221> mis cofeature 
<222> (855) . . (855) 
<223> n=a, c, g or t 

ggaggttcag gaagccattt tctttcgcgt ctgcggtgct cggagtgtgg tacttctcct 60 
agttgcagtc aggcttcata cgctattgtc ctgcccgtta gagcagccag cgggtacaga 120 
atggattttg gaagagggag tcaccactgg acctccaagg aagccacgtg cagacatcta 180 
caaccttcga tctcctgacg agtttattgt tggccaaaac caggctttga ttgaaccagg 240 
atgaatgcgg gtgttggaag tagaatatat atatacatat aaaattggtt gggagccacg 3 00 
tgtaccagtg tgtgttgatc ttggcttgat tcagtctgcc ttgtaacaga aactggcgat 360 
ggaatatgag aggagccctc tggaaagaaa aggacagacc ctgtgctttc atgaaagtga 420 
agatctggct gaaccagttc cacaaggtta ctgtatacat agcctgagfct taaaaggctg 480 
tgcccacttc aagaatgtca ttgttagact ttgaaatttc taactgccta cctgcataaa 540 
gaaaataaaa tcttttaaat caaaaaaaaa aaccaaaaaa aatatctata aacacaaaga 
aaaacaaatt caaacttaca cattatccaa tcattctatt attataaata tatgaaaaaa 



600 
660 
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aaaaaaactt gggggcccgc ttggcccagg gggttttaaa tctggaccaa tttattcttg 720 

gggcccaatt tggggctcat tttggccaat tgccccttta ggggaggcgt ttacaattca 780 

tgggccggtt tttcaaacgg tggtatttgg aaaacccctg gggttaccca atttttaaac 840 

ccttgggaaa aaatnccctt ttttcacagt ggggggaaat tccgaaggcc ccgccccgtg 900 

928 

aggccttttc catttggccc ctctttgg 

<210> 79 

<211> 2448 

<212> DNA 

<213> Homo sapien 



60 



<400> 79 u 
tttttttttt tcactagtag cattgttaat attaaacaag atagctaata gctttaaagc 

gtctaatcac ggtaccccca cgagtagtag tcacacagca aatcataggc aggagtggac 120 

aaacagcgct gggagcatta attctggtag atttggagtc tgggcagggc ctaagaggat 18 0 

gggatcagcg gattgggcca gccaaagggg gcgggttcag agggaggagc cagctgcctc 240 

ccgctgcaag ccccgagctt tgaatcaagc gcggtagttt cagctgtagc caatcctaga 300 

acaaaatctt tttgcgtcag ttgttgctga ggcctagatc tcacccagat cagttctagt 360 

cgctgctgct tgtacccgtc ctcggccctt gcgtgtccgt ccgttacgac tcagaggacg 420 

gagagaaagt ctaagtcacc ttcctcagta tccaggccgg aaggagactc tcagctactc 4 80 

gccctgaaca tgcagagcag cactggcttc cctgtctctg gatgccgcag gggcgtcctg 540 

tcaggagttg tagtgtcaag attccagttc ttcttatgcg acttgctgga gaagcaggaa 600 

gttgctgttt attggagaag tcgggctagg gagggaaaca atcccgcgga gccatatgga 660 

gagggcagga ggtgtctaga tagtggtagg ccggcggggg acggagggta tgggtggtga 720 

gtcctggccc tttaaacccg agggggagga tccggaagtg gatgggcggg gcgaagggtc 780 

ctcgtatata aaagggcccc ggccgcgcgg ggtctctaat ctgccatttt ctgtccctga 840 

gtgagtctct ggcgtcccaa attgcctgtt tttctcgcag get ct at tec gttcgctggt 900 

tcgccacctc aggggaacga tggccatgga gtccacagcc actgccgccg tcgccgcgga 960 

gctggtttct gecgacaaaa ttgaagatgt tcctgctcct tctacatctg cagataaagt 1020 

ggagagcaac cattaatctg ctttctgtca ctgaagatgg gttgcatttt gtagaatatt 1080 

atctaaatag aatcatacat ctggatgtgg atagtgaagc taagaaacta ttgggtttag 114 0 

gacagaaaca tctggtgatg ggggatattc cagcagctgt caatgeatte caggaagcag 1200 

ctagtctttt aggtaagaag tatggagaga cagctaatga gtgtggagaa gecttctttt 1260 

tctatgggaa atcacttctg gagttggcaa gaatggagaa tggtgtgttg ggaaacgect 1320 
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tggaaggtgt 
aaaataatga 
agactgaaga 
aggaggagat 
ttaaaaggca 
gagaagttag 
ttaacctgca 
agctgggctt 
aatctattga 
aaggatcgtc 
ttagagagaa 
ctctgaaagc 
cttcagtctc 
tgactgatat 
aagatgatgc 
tccccagtgg 
gccgggcagc 
aagagggggc 
cttttggagg 



gcatgtggaa 
taacatagat 
aatgccaaat 
tgggaaccta 
agaaacaaaa 
tgttgaatct 
ggaacagtac 
ggcttatggg 
agtcattgag 
tgctgaatac 
gatagaagat 
tactctggtg 
catgattgcc 
ttcccacctt 
aaagaaagcc 
aaatgaagtt 
agtggagggg 
acagccctcc 
attctttttg 



gaggaagaag 
gaaactgaag 
gattcagtcc 
gagcttgcct 
gaagcacagc 
gaaaactatg 
ctggaagccc 
tacaactctc 
aacagaatgg 
aagaaagaaa 
gcaaaggagt 
gagagttcta 
agtagaaagc 
gtcagaaaga 
aaacaagagc 
tcggaaaaca 
acagtggagg 
tcccaaggga 
tataacttca 



gagaaaaaac 
gctcggaaga 
ttgaaaacaa 
gggatatgct 
tttatgctgc 
tgcaagctgt 
acgaccgtct 
agtatgatga 
ctgtactaaa 
ttgaggaact 
ctcagcgtag 
cttcaggttt 
caacagacgg 
agaggaaacc 
cggaggtgaa 
tggaggagga 
ctggagctac 
aagtgttttt 
ataaagattg 



agaagatgaa 
ggatgataaa 
gtctcttcaa 
ggatttagca 
ccaggcacat 
ggaggagttc 
ccttgcagag 
ggcagtggca 
cgagcaggtg 
aaaggaactg 
tgggaatgta 
cactcctggt 
tgcttcctca 
agaggaagag 
cggaggcagt 
ggctgagaat 
agttgaaagc 
gtatataatg 
taagcaaa 



tctctggtag 
gaaaatgata 
gaaaatgagg 
aagatcattt 
cttaaactcg 
cagtcctgcc 
acccactacc 
cagttcagca 
aaggaggctg 
ctacccgaaa 
gctgaactgg 
ggaggaggct 
tcaaattgtg 
agtccccgga 
ggggatgctg 
caggctgaaa 
actgcatgtt 
tattttttca 



<210> 80 

<211> 1648 

<212> DNA 

<213> Homo sapien 

<400> 80 

tgacgcgaat gacgctgcgc cagtcaggcc gcagccccgc 
ccttggccgg ctcggcccgc ccccggccct ccctgcacgg 
gactgtggcg gccggtcgcg cggtgcgctc tccctccctg 
ttcgtgctgc acacccccgc gttcctgccg gcaccgcgcc 
cctgccgccg accgcacgcc cgccgcggga catggcacac 
cgcccggggc tccggggacg gcgagatggg caagcccagg 
tatcacaggc caggatggtt cctacctggc tgagttcctg 
ccatggaatt gtacggcggt ccagttcatt taatacgggt 



1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2448 



tgcttggccc gtcgggcccg 60 

cctcccgtgc gcccctgtca 120 

cccgcagcct ggagaggcgc 180 

tgccctctgc cgcgctccgc 24 0 

gcaccggcac gctgccccag 300 

aacgtggcgc tcatcaccgg 360 

ctggagaaag gctatgaggt 420 

cgaattgagc atctgtataa 480 
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gaatccccag gctcacattg aaggaaacat gaagttgcac tatggcgatc tcactgacag 
tacctgcctt gtgaagatca ttaatgaagt aaagcccaca gagatctaca accttggagc 
ccagagccac gtcaaaattt cctttgacct cgctgagtac actgcggacg ttgacggagt 
tggcactcta cgacttctag atgcagttaa gacttgtggc cttatcaact ctgtgaagtt 
ctaccaagcc tcaacaagtg aactttatgg gaaagtgcag gaaatacccc agaaggagac 
cacccctttc tatccccggt caccctatgg agctaatttc gttactcgaa aaattagccg 
gtcagtagct aagatttacc ttggacaact ggaatgtttc agtttgggaa atctggatgc 
caaacgagat tggggccatg ccaaggacta tgtggaggct atgtggttga tgttgcagaa 
tgatgagccg gaggacttcg ttatagctac tggggaggtc catagtgtcc gggaatttgt 
cgagaaatca ttcttgcaca ttggaaaaac cattgtgtgg gaaggaaaga atgaaaatga 
agtgggcaga tgtaaagaga ccggcaaagt tcacgtgact gtggatctca agtactaccg 
gccaactgaa gtggactttc tgcagggcga ctgcaccaaa gcgaaacaga agctgaactg 
gaagccccgg gtcgctttcg atgagctggt gagggagatg gtgcacgccg acgtggagct 
catgaggaca aaccccaatg cctgagcagc gcctcggagc ccggcccgcc ctccggctac 
aatccccgca gagtctccgg tgcagacgcg ctgcggggat ggggagcggc gtgccaatct 
gcgggtcccc tgcggcccct gctgccgctg cgctgtcccg gccgcaagag cggggccgcc 
ccgccgaggt ttgtagcagc cgggatgtga ccctccaggg tttgggtcgc tttgcgtttg 
tcgaagcctc ctctgaatgg ctttgtgaaa tcaagatgtt ttaatcacat tcactttact 
tgaaattatg ttgttacaca acaaattgtg gggccttcaa attgtttttc tcttttcata 
ttaaaaatgg tctttctgtg aactagca 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1648 



<210> 81 

<211> 2595 

<212> DNA 

<213> Homo sapien 



<400> 81 
tgacgcgaat 

ccttggccgg 

gactgtggcg 

ttcgtgctgc 

cctgccgccg 

cgcccggggc 

tatcacaggc 



gacgctgcgc 
ctcggcccgc 
gccggtcgcg 
acacccccgc 
accgcacgcc 
tccggggacg 
caggatggtt 



cagtcaggcc 
ccccggccct 
cggtgcgctc 
gttcctgccg 
cgccgcggga 
gcgagatggg 
cctacctggc 



gcagccccgc 
ccctgcacgg 
tccctccctg 
gcaccgcgcc 
catggcacac 
caagcccagg 
tgagttcctg 



tgcttggccc 
cctcccgtgc 
cccgcagcct 
tgccctctgc 
gcaccggcac 
aacgtggcgc 
ctggagaaag 



gtcgggcccg 
gcccctgtca 
ggagaggcgc 
cgcgctccgc 
gctgccccag 
tcatcaccgg 
gctatgaggt 



60 
120 
180 
240 
300 
360 
420 
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ccatggaatt gtacggcggt ccagttcatt taatacgggt cgaattgagc atctgtataa 480 

gaatccccag gctcacattg aaggaaacat gaagttgcac. tatggcgatc tcactgacag 540 

tacctgcctt gtgaagatca ttaatgaagt aaagcccaca gagatctaca accttggagc 600 

ccagagccac gtcaaaattt cctttgacct cgctgagtac actgcggacg ttgacggagt 660 

tggcactcta cgacttctag atgcagttaa gacttgtggc cttatcaact ctgtgaagtt 720 

ctaccaagcc tcaacaagtg aactttatgg gaaagtgcag gaaatacccc agaaggagac 780 

cacccctttc tatccccggt caccctatgg ggcagcaaaa ctctatgcct attggattgt 84 0 

ggtgaacttc cgtgaggcgt ataatctctt tgcagtgaac ggcattctct tcaatcatga 900 

gagtcccaga agaggagcta atttcgttac tcgaaaaatt agccggtcag tagctaagat 960 

ttaccttgga caactggaat gtttcagttt gggaaatctg gatgccaaac gagattgggg 1020 

ccatgccaag gactatgtgg aggctatgtg gttgatgttg cagaatgatg agccggagga 1080 

cttcgttata gctactgggg aggtccatag tgtccgggaa tttgtcgaga aatcattctt 1140 

gcacattgga aaaaccattg tgtgggaagg aaagaatgaa aatgaagtgg gcagatgtaa 1200 

agagaccggc aaagttcacg tgactgtgga tctcaagtac taccggccaa ctgaagtgga 1260 

gaccaattaa ttgttcaaag gtaatatata gagatctcaa gagcatttac tttggcacag 132 0 

tctctgcccc tttgtggtct ggctaacaat aaagctaaga tttgttttgt aatgacggtt 13 80 

ccagttggac tggaattcct ttgaagttcc tgaagtttgc tgttgtgaga gctcaagata 1440 

ttatttgact gcaatgaaac actctctgtg tattatcaaa atatcgtata cactgaactg 1500 

taacaacttg cagtaatcag acttctcctt cgaggtggcc tgcgtgagta gagagcatcc 1560 

acagagagat gagcttctcc caacatccat ttagtgttca gtcatcacgg gccactcatg 1620 

gatctgaacc ttcttacaag gtcgatacca acaggtatga ggagcagaga gcagatcagc 1680 

acggagcagc attagctggg gcctccttgc ctggcctggc agtgaagata gagcaccatg 1740 

ccagttgttc agcttctcag cgcttgcatc agctgaccta aattatttaa ggatgttgtc 1800 

acctctaaat aaatatagtc attgtcctgg tctctattta tattttaaaa ctctttccca 1860 

gcccaccccg atggcctaaa acaattcata tacttcagta gaagagggtg tcacagggct 1920 

atttgcacag gtttggatcc agtgccagcg ttagccttca cacaagttac actcctctga 1980 

gatgtaactg tctcctgtgt atgagtgcca aagtatttga taaccgccat ctgtctgctc 2040 

tcaacacagg tttcacaagc acagtgatgt cttccattgt gtgcgtaaca actgtgtgtg 2100 

tgcggagaga gaggactctg tacgcatgca catgtttgga gctacataaa tctgagaagc 2160 

agttttgtct agtgtcgttt cagtaaatgg gactcattta gtcagtttgc tttgctgtca 222 0 
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gaatcatatt 


ccagatatca gaatgtgcta tggcaggtac 


ttgtttaaaa 


caagtaataa 


2280 


tatccttgta 


atgtttagca 


atacagctga ctggtctgtg 


ttccattact 


tgtgtaaaag 


2340 


gaaatatgta 


aagcttttaa 


aaataagttt aaatgatcaa 


aagagatgcc 


tgaaaatata 


2400 


ttttggcaca ggaaattaca ggaatttata aatgtgtcat gtggaaagat aattacaaag 


2460 


tttattttaa 


tcttttatct 


ctgactgaat cttaagttaa 


aatattcttc 


ccggcatcct 


2520 


gtctggtttt 


tgccactgtc 


cctgctttac gtgttacctt 


tccatgttga 


tctgagttgc 


2580 


atttagttga 


ggaga 








2595 


<210> 82 
<211> 1706 
<212> DNA 
<213> Homo sapien 










<400> 82 
tgacgcgaat 


gacgctgcgc 


cagtcaggcc gcagccccgc 


tgcttggccc 


gtcgggcccg 


60 


ccttggccgg 


ctcggcccgc 


ccccggccct ccctgcacgg 


cctcccgtgc 


gcccctgtca 


120 


gactgtggcg 


gccggtcgcg 


cggtgcgctc tccctccctg 


cccgcagcct 


ggagaggcgc 


180 


ttcgtgctgc 


acacccccgc 


gttcctgccg gcaccgcgcc 


tgccctctgc 


cgcgctccgc 


240 


cctgccgccg 


accgcacgcc 


cgccgcggga catggcacac 


gcaccggcac 


gctgccccag 


300 


cgcccggggc 


tccggggacg 


gcgagatggg caagcccagg 


aacgtggcgc 


tcatcaccgg 


360 


tatcacaggc 


caggatggtt 


cctacctggc tgagttcctg 


ctggagaaag 


gctatgaggt 


420 


ccatggaatt 


gtacggcggt 


ccagttcatt taatacgggt 


cgaattgagc 


atctgtataa 


480 


gaatccccag 


gctcacattg 


aaggaaacat gaagttgcac 


tatggcgatc 


tcactgacag 


540 


tacctgcctt 


gtgaagatca 


ttaatgaagt aaagcccaca 


gagatctaca 


accttggagc 


600 


ccagagccac 


gtcaaaattt 


cctttgacct cgctgagtac 


actgcggacg 


ttgacggagt 


660 


tggcactcta 


cgacttctag atgcagttaa gacttgtggc 


cttatcaact 


ctgtgaagtt 


720 


ctaccaagcc 


tcaacaagtg 


aactttatgg gaaagtgcag gaaatacccc 


agaaggagac 


780 


cacccctttc 


tatccccggt 


caccctatgg ggcagcaaaa 


ctctatgcct 


attggattgt 


840 


ggtgaacttc 


cgtgaggcgt 


ataatctctt tgcagtgaac 


ggcattctct 


tcaatcatga 


900 


gagtcccaga 


agaggagcta 


atttcgttac tcgaaaaatt 


agccggtcag 


tagctaagat 




ttaccttgga 


caactggaat 


gtttcagttt gggaaatctg gatgccaaac gagattgggg 


1020 


ccatgccaag 


gactatgtgg 


aggctatgtg gttgatgttg. cagaatgatg 


agccggagga 


1080 


cttcgttata 


gctactgggg 


aggtccatag tgtccgggaa 


tttgtcgaga 


aatcattctt 


1140 


gcacattgga 


aaaaccattg tgtgggaagg aaagaatgaa aatgaagtgg gcagatgtaa 


1200 
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agagaccggc aaagttcacg tgactgtgga tctcaagtac taccggccaa ctgaagtggt 1260 

aaggactctc tggccaccca gtgcgtggcc acgtttggct ggctggctgg gcaaatgcgc 1320 

tcatggaatg cctggtgcca gcctttggag ctgtcagttc agctccctgg catccttcag 1380 

cgtgcatttc caaaatcaga acacagtgaa ttcaatttga gattatcatt cttaccacaa 1440 

acttcattgc tctataaagg aaaacaagag ttaaacccac caaaagaaat cattgatttc 1500 

agaaatggaa tctcacatca cacgcagatc tcctcatgtt ggtctcctat tctaatggct 1560 

ggtccctgat aacctacagg atgcctgact taacaacgct gtcccagacc tgcaggacat 162 0 

ctaggtcctc tccagctatg tctatgtctg tgtctgtgta ctcatacaca gtgcatactt 1680 

gcacactata ccataaatgg tctccc 1706 

<210> 83 

<211> 2667 

<212> DNA 

<213> Homo sapien 

<400> 83 

ttggtttttt tttttttttt gcttttttgt tttttgagac agggtcttgc tctgttgccc 60 

aggctggggt gcagtggcga gatctcagtt cactgcaacc tccacatccc ggctttaacc 120 

gattctccca tctcagcctc acgagtagct gggattacag ggctgcatca acacacctgg 180 

ctaatttttg tatttttagt acagatgggg tttcaccatg ttggccaggc tggtctcaaa 24 0 

ctcctgacct caggtgatct gccagcttca gcctcccaaa gtgctgggat tacaggcgtg 300 

agctaccacg cctggccttg gccatgtaga tttctttcac accttcctta ttccttcttc 360 

agaagctgag gccccatgag gggaagctgg tagccagggg tcttttcagg tacctgtggc 420 

caaattaggc ccagatgtta atgctctcat ccatggctga aaaaggccaa gaaactaaga 480 

agagctgtgg acgcacagac tggcagagac accgagaggg cactgcgacc gcactgcaca 540 

gatgagcaaa ggaaggccaa cttcagcatt ccctcctcct ctctaagccc ctccaaaagg 600 

cctgctcagg gagctctggc caccgttctc aagcaattgc ctgacgtttt tactgacact 660 

gagggctgga gatggtccct tttccaggca gctatgtgtc catttcctag agagggaaac 720 

tgagtcccag agaaggcggg tgtggctgga ggccatgggt cagtcatagg cataaccatg 780 

ccgtgaataa ccgtcttggc actctctgac ttctttccct ggcaatacaa cggatcccag 840 

tgcccatttg tcacataagg aaactgaggc tggcgtcttc gatgaagcct ggcaggagtg 900 

-ggggtcacac acggagtgca aggctgtcca agccttcaag gtcattctga etgatctttg 960 

agccttagca atacaggacg tgggtccaag agcatttcca gcacaaaatg gcgcctcata 1020 

gaaaaatgta gtcggagttc cagcgcccac ctccaccacg gctcaaagga gggggcgcgg 1080 
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gcggcagcgc acacttcatt tgcatataca aaaatcaacg gatttcgttg gctggcagcg 114 0 

gtttgatctg cacgaaactg ggtcgccctc tgaccggtct ctatcagtca ttggcccatg 1200 

cgagtctcgt gacccaccag tcactaaagc gcatccctat tggcccgagc gccttccatc 1260 

acgcgtttaa aaccccgagt cccgcctccc ctcgccacgc tccggccccc aggaagaccg 1320 

ttagtcgcgt gcgcgacttc ctcagcgtca gggcgcgccc gcgcgtatgc cgggccgtgg 1380 

cggcgtctca ttcaaacggt ccaatcagcg gccgcgtcgc cttccccgtc ccccttcctc 1440 

tcttcttccc cttttcagcc cctccgttct ttgagaaccc tcccgacccc cgcgggcccc 1500 

cattggccgg cccgctaccc agccctctcc gccacttccc tcgcttctga ccatagtttg 1560 

cggggaaggg agcgagcgcg tcgaaaacca aggaacgtgc gcgctgacgt cacggttgag 1620 

gctcggagct gaggggccgc ggagggcgtg gcctgcgggc ggttataaag aggcagtggt 1680 

gcgcgcgcgg ccggctcagt gctgccgggc accggggcgg cgggttggtc tacgctgtgc 174 0 

gcggcggacg tcggaggcag cggggagcgg agcggggccg ccggggcctc tccagggccg 1800 

cagcggcagc agttgggccc cccgccccgg ccggcggacc gaagaacgca ggaagggggc 1860 

cggggggacc cgcccccggc cggccgcagc catgaactcc aacgtggaga acctaccccc 1920 

gcacatcatc cgcctggtgt acaaggaggt gacgacactg accgcagacc cacccgatgg 1980 

catcaaggtc tttcccaacg aggaggacct caccgacctc caggt caeca tcgagggccc 2040 
tgaggggacc ecatatgetg gaggtctgtt ccgcatgaaa ctcctgctgg ggaaggactt 2100 
ccctgcctcc ccacccaagg gctacttcct gaccaagatc ttccacccga acgtgggcgc 2160 
caatggegag atetgegtea acgtgctcaa gagggactgg aeggctgage tgggcatccg 2220 
acaegtactg ctgaccatca agtgcctgct gatccaccct aaccccgagt ctgcactcaa 2280 
egaggaggeg ggccgcctgc tcttggagaa ctacgaggag tatgeagetc gggcccgtct 2340 
gctcacagag atecaegggg gcgccggcgg gcccagcggc agggecgaag ccggtcgggc 2400 
cctggccagt ggcactgaag cttcctccac cgaccctggg gccccgggcc ggagggctga 2460 
agtccattgg ccagagcatg tgggaaggga aagacgttgg ggcagaaaga egacagaegg 2520 
cgcgcgcgtg aaggtctttc tctcacgtga ccattctgct cccaatttta gtaattgegg 2580 
cccgagcggg cgccgtgtca atgcccaaac aaaaaaacct tctcgaaaag gtgtcctcag 264 0 
tgctgctttt caggcgtcgc ttcttca 2667 

<210> 84 

<211> 2648 

<212> DNA 

<213> Homo sapien 

<400> 84 
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ttggtttttt tttttttttt gcttttttgt tttttgagac agggtcttgc tctgttgccc 60 

aggctggggt gcagtggcga gatctcagtt cactgcaacc tccacatccc ggctttaacc 120 

gattctccca tctcagcctc acgagtagct gggattacag ggctgcatca acacacctgg 180 

ctaatttttg tatttttagt acagatgggg tttcaccatg ttggccaggc tggtctcaaa 240 

ctcctgacct caggtgatct gccagcttca gcctcccaaa gtgctgggat tacaggcgtg 3 00 

agctaccacg cctggccttg gccatgtaga tttctttcac accttcctta ttccttcttc 360 

agaagctgag gccccatgag gggaagctgg tagccagggg tcttttcagg tacctgtggc 42 0 

caaattaggc ccagatgtta atgctctcat ccatggctga aaaaggccaa gaaactaaga 480 

agagctgtgg acgcacagac tggcagagac accgagaggg cactgcgacc gcactgcaca 540 

gatgagcaaa ggaaggccaa cttcagcatt ccctcctcct ctctaagccc ctccaaaagg 600 

cctgctcagg gagctctggc caccgttctc aagcaattgc ctgacgtttt tactgacact 660 

gagggctgga gatggtccct tttccaggca gctatgtgtc catttcctag agagggaaac 720 

tgagtcccag agaaggcggg tgtggctgga ggccatgggt cagtcatagg cataaccatg 780 

ccgtgaataa ccgtcttggc actctctgac ttctttccct ggcaatacaa cggatcccag 840 

tgcccatttg tcacataagg aaactgaggc tggcgtcttc gatgaagcct ggcaggagtg 900 

ggggtcacac acggagtgca aggctgtcca agccttcaag gtcattctga ctgatctttg 960 

agccttagca atacaggacg tgggtccaag agcatttcca gcacaaaatg gcgcctcata 102 0 

gaaaaatgta gtcggagttc cagcgcccac ctccaccacg gctcaaagga gggggcgcgg 1080 

gcggcagcgc acacttcatt tgcatataca aaaatcaacg gatttcgttg gctggcagcg 1140 

gtttgatctg cacgaaactg ggtcgccctc tgaccggtct ctatcagtca ttggcccatg 1200 

cgagtctcgt gacccaccag tcactaaagc gcatccctat tggcccgagc gccttccatc 1260 

acgcgtttaa aaccccgagt cccgcctccc ctcgccacgc tccggccccc aggaagaccg 1320 

ttagtcgcgt gcgcgacttc ctcagcgtca gggcgcgccc gcgcgtatgc cgggccgtgg 1380 

cggcgtctca ttcaaacggt ccaatcagcg gccgcgtcgc cttccccgtc ccccttcctc 1440 

tcttcttccc cttttcagcc cctccgttct ttgagaaccc tcccgacccc cgcgggcccc 1500 

cattggccgg cccgctaccc agccctctcc gccacttccc tcgcttctga ccatagtttg 1560 

cggggaaggg agcgagcgcg tcgaaaacca aggaacgtgc gcgctgacgt cacggttgag 1620 

gctcggagct gaggggccgc ggagggcgtg gcctgcgggc ggttataaag aggcagtggt 1680 

gcgcgcgcgg ccggctcagt gctgccgggc accggggcgg cgggttggtc tacgctgtgc 1740 

gcggcggacg tcggaggcag cggggagcgg agcggggccg ccggggcctc tccagggccg 1800 

cagcggcagc agttgggccc cccgccccgg ccggcggacc gaagaaccta cccccgcaca 1860 
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tcatccgcct ggtgtacaag gaggtgacga cactgaccgc 
aggtctttcc caacgaggag gacctcaccg acctccaggt 
ggaccccata tgctggaggt ctgttccgca tgaaactcct 
cctccccacc caagggctac ttcctgacca agatcttcca 
gcgagatctg cgtcaacgtg ctcaagaggg actggacggc 
tactgctgac catcaagtgc ctgctgatcc accctaaccc 
aggcgggccg cctgctcttg gagaactacg aggagtatgc 
cagagatcca cgggggcgcc ggcgggccca gcggcagggc 
ccagtggcac tgaagcttcc tccaccgacc ctggggcccc 
agggtcccat ggccaagaag catgctggcg agcgcgataa 
agacggacaa gaagcgggcg ctgcggcggc tgtagtgggc 
gaccccaacc tctcctgtcc cctccctcca actctgtctc 
tggggtcggg gagggtacag ggggcactgg gacctggatt 
gaaaagca 



agacccaccc gatggcatca 1920 

caccatcgag ggccctgagg 1980 

gctggggaag gacttccctg 2040 

cccgaacgtg ggcgccaatg 2100 

tgagctgggc atccgacacg 2160 

cgagtctgca ctcaacgagg 2220 

agctcgggcc cgtctgctca 2280 

cgaagccggt cgggccctgg 2340 

agggggcccg ggaggggctg 24 00 

gaagctggcg gccaagaaaa 2460 

tctcttcctc cttccaccgt 252 0 

taagttattt aaattatggc 2580 

tgtttttcta aataaagttg 2640 

2648 



<210> 85 

<211> 2800 

<212> DNA 

<213> Homo sapien 

<400> 85 

ttggtttttt tttttttttt gcttttttgt tttttgagac agggtcttgc tctgttgccc 
aggctggggt gcagtggcga gatctcagtt cactgcaacc tccacatccc ggctttaacc 
gattctccca tctcagcctc acgagtagct gggattacag ggctgcatca acacacctgg 
ctaatttttg tatttttagt acagatgggg tttcaccatg ttggccaggc tggtctcaaa 
ctcctgacct caggtgatct gccagcttca gcctcccaaa gtgctgggat tacaggcgtg 
agctaccacg cctggccttg gccatgtaga tttctttcac accttcctta ttccttcttc 
agaagctgag gccccatgag gggaagctgg tagccagggg tcttttcagg tacctgtggc 
caaattaggc ccagatgtta atgctctcat ccatggctga aaaaggccaa gaaactaaga 
agagctgtgg acgcacagac tggcagagac accgagaggg cactgcgacc gcactgcaca 
gatgagcaaa ggaaggccaa cttcagcatt ccctcctcct ctctaagccc ctccaaaagg 
cctgctcagg gagctctggc caccgttctc aagcaattgc ctgacgtttt tactgacact 
gagggctgga gatggtccct tttccaggca gctatgtgtc catttcctag agagggaaac 
tgagtcccag agaaggcggg tgtggctgga ggccatgggt cagtcatagg cataaccatg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 

780 
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ccgtgaataa ccgtcttggc actctctgac ttctttccct ggcaatacaa cggatcccag 840 

tgcccatttg tcacataagg aaactgaggc tggcgtcttc gatgaagcct ggcaggagtg 900 

ggggtcacac acggagtgca aggctgtcca agccttcaag gtcattctga ctgatctttg 960 

agccttagca atacaggacg tgggtccaag agcatttcca gcacaaaatg gcgcctcata 1020 

gaaaaatgta gtcggagttc cagcgcccac ctccaccacg gctcaaagga gggggcgcgg 1080 

gcggcagcgc acacttcatt tgcatataca aaaatcaacg gatttcgttg gctggcagcg 1140 

gtttgatctg cacgaaactg ggtcgccctc tgaccggtct ctatcagtca ttggcccatg 1200 

cgagtctcgt gacccaccag tcactaaagc gcatccctat tggcccgagc gccttccatc 1260 

acgcgtttaa aaccccgagt cccgcctccc ctcgccacgc tccggccccc aggaagaccg 1320 

ttagtcgcgt gcgcgacttc ctcagcgtca gggcgcgccc gcgcgtatgc cgggccgtgg 1380 

cggcgtctca ttcaaacggt ccaatcagcg gccgcgtcgc cttccccgtc ccccttcctc 1440 

tcttcttccc cttttcagcc cctccgttct ttgagaaccc tcccgacccc cgcgggcccc 1500 

cattggccgg cccgctaccc agccctctcc gccacttccc tcgcttctga ccatagtttg 1560 

cggggaaggg agcgagcgcg tcgaaaacca aggaacgtgc gcgctgacgt cacggttgag 1620 

gctcggagct gaggggccgc ggagggcgtg gcctgcgggc ggttataaag aggcagtggt 1680 

gcgcgcgcgg ccggctcagt gctgccgggc accggggcgg cgggttggtc tacgctgtgc 1740 

gcggcggacg tcggaggcag cggggagcgg agcggggccg ccggggcctc tccagggccg 1800 

cagcggcagc agttgggccc cccgccccgg ccggcggacc gaagaacgca ggaagggggc 1860 

cggggggacc cgcccccggc cggccgcagc catgaactcc aacgtggaga acctaccccc 1920 

gcacatcatc cgcctggtgt acaaggaggt gacgacactg accgcagacc cacccgatgg 198 0 

catcaaggtc tttcccaacg aggaggacct caccgacctc caggtcacca tcgagggccc 2040 

tgaggggacc ccatatgctg gaggtctgtt ccgcatgaaa ctcctgctgg ggaaggactt 2100 

ccctgcctcc ccacccaagg gctacttcct gaccaagatc ttccacccga acgtgggcgc 2160 

caatggcgag atctgcgtca acgtgctcaa gagggactgg acggctgagc tgggcatccg 2220 

acacgtactg ctgctttcct ggaaagacaa acagtgccag acccaggaca cccaagtact 2280 

gttgagaagt gcccaggagc accttactat gcaacgagtg accatcaagt gcctgctgat 2340 

ccaccctaac cccgagtctg cactcaacga ggaggcgggc cgcctgctct tggagaacta 2400 

cgaggagtat gcagctcggg cccgtctgct cacagagatc cacgggggcg ccggcgggcc 2460 

cagcggcagg gccgaagccg gtcgggccct ggccagtggc actgaagctt cctccaccga 2520 

ccctggggcc ccagggggcc cgggaggggc tgagggtccc atggccaaga agcatgctgg 2580 
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cgagcgcgat aagaagctgg cggccaagaa aaagacggac aagaagcggg cgctgcggcg 2640 

gctgtagtgg gctctcttcc tccttccacc gtgaccccaa cctctcctgt cccctccctc 2700 

caactctgtc tctaagttat ttaaattatg gctggggtcg gggagggtac agggggcact 2760 

gggacctgga tttgtttttc taaataaagt tggaaaagca 2800 

<210> 86 

<211> 717 

<212> DNA 

<213> Homo sapien 

<400> 86 

ccccaggggc tcatcccagc ctggacttga cacgttgctc tctgttttta cctaaaagag 60 

ccagaagcgc tatcacgaag atatctttgg tgctgtattt ccttacgagg cgaagaagga 120 

cagggtggcg gtgcagccca gcagcctgga gatgtcagcg ctctgagggc ctacaggagg 180 

ggttaaagct gccggcacag aacttaagga tggagccagc tctgcattat ctgaggtcac 240 

agggcctggg gagatggagg aaagtgatat cccccagcct caagtcttat ttcctcaacg 300 

ttgctcccca tcaagccctt tacttgacct cctaacaagt agcaccctgg attgatcgga 3 60 

gcctcctctc tcaaactggg gcctccctgg tcccttggag acaaaatctt aaatgccagg 42 0 

cctggcaagt gggtgaaaga tggaacttgc tgctgagtgc accacttcaa gtgaccacca 480 

ggaggtgcta tcgcaccact gtgtatttaa ctgccttgtg tacagttatt tatgcctctg 540 

tatttaaaaa actaacaccc agtctgttcc ccatggccac ttgggtcttc cctgtatgat 600 

tccttgatgg agatatttac atgaattgca ttttacttta atcacactgt atgcgtgtgt 660 

gggtgttttg tagggaaagc tcttctcaga gtggggagct ggtgggtgtc acagcct 717 

<210> 87 

<211> 1098 

<212> DNA 

<213> Homo sapien 

<400> 87 

tgccagggag ctccaacaga actgagggac gaacagtagg tgcctgagag ctggccgatc 60 

tggtcactct ctccagtggg ccacaggggc ccacttggtg ggcagtctct gctgccagct 120 

agggcatctc agctcagccc agagcaaaga cgtgtaggga cataacgctt tcctcaagct 180 

tcctttccca ggtccaaact tagaggccca gttttgggcc tagaattcac tgaggtttgt 24 0 

ggctgtttgt ccaacagtca ccatggatca ccgaagccga ctacggggca caggcctgaa 300 

ccgaatccct gggactcagt cccgagcccc ccgagtccca ctccccttcc acgtgcaaca 360 

ggaggccagg gaaggagaag actgggagcg agagccacct cgtcagaggc ctcctatcta 42 0 

tgagccacca gaaagtgaag agctgccaga taatgttatg gtttccaagc cagcgcccta 480 
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ctgggaagga acagctgtga tcgatggaga atttaaggag ctgaagttaa ctgattatcg 540 

tgggaaatac ttggttttct tcttctaccc acttgatttc acatttgtgt gtccaactga 600 

aattatcgct tttggcgaca gacttgaaga attcagatct ataaatactg aagtggtagc 660 

atgctctgtt gattcacagt ttacccattt ggcctggatt aatacccctc gaagacaagg 720 

aggacttggg ccaataagga ttccacttct ttcagatttg acccatcaga tctcaaagga 780 

ctatggtgta tacctagagg actcaggcca cactcttaga ggtctcttca ttattgatga 840 

caaaggaatc ctaagacaaa ttactctgaa tgatcttcct gtgggtagat cagtggatga 900 

gacactacgt ttggttcaag cattccagta cactgacaaa cacggagaag tctgccctgc 960 

tggctggaaa cctggtagtg aaacaataat cccagatcca gctggaaagc tgaagtattt 1020 

cgataaactg aattgagaaa tacttcttca agttatgatg cttgaaagtt ctcaataaag 1080 

ttcacggttt cattacca 1098 

<210> 88 

<211> 731 

<212> DNA 

<213> Homo sapien 

<220> 

<221> mis cofeature 
<222> (21) . . (21) 
<223> n=a, c, g or t 

<400> 88 

tatagataga tatgtttatc nccataactc ataactggga gtattatacc caagaggctt 60 

tgttacaagg tattttttac cctgttagta aatattttcc tgtaattcta taataagcaa 120 

ataaattttc atggttgggg tagatgcaaa aaaaaaagaa ttcgaattcg aattcggaac 180 

gtcatcgttt ggaaagcgtc gcaataagac gcacacgttg tgccgccgct gtggctctaa 240 

ggcctaccac cttcagaagt cgacctgtgg caaatgtggc taccctgcca agcgcaagag 300 

aaagtataac tggagtgcca aggctaaaag acgaaatacc accggaactg gtcgaatgag 360 

gcacctaaaa attgtatacc gcagattcag gcatggattc cgtgaaggaa caacacctaa 420 

acccaagagg gcagctgttg cagcatccag ttcatcttaa gaatgtcaac gattagtcat 4 80 

gcaataaatg ttctggtttt aaaaaataca tatctggttt tggtaaggta tttttaatca 540 

attaggcttg tagtatcagt gaaatactgt aggtttaggg actgggctag cttcatatca 600 

gatttacttg ttaagtgact gttttggaat gtttactttt ggactgggtt tgtaacacgg 660 

ttaaaggcaa tgagaaacaa gcagaattcc aggagtcctt gaagcagagg gcactggaag 720 

acaatatagc a 731 
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<210> 89 

<211> 777 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_ feature 

<222> (50).. (50) 

<223> n=a, c, g or t 



<220> 

< 2 2 1 > mi s cofeature 

<222> (90) . - (90) 

<223> n=a, c, g or t 



<220> 

<221> mis cofeature 

<222> (199) . . (199) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (201) . . (201) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (204) . . (204) 

<223> n=>a, c, g or t 



<220> 

<221> mis cofeature 

<222> (206) . . (206) 

<223> n=a, c, g or t 



<400> 89 

atgttctttg tttgacaatc tcataagact ttagtcttac agacgacccn tcgaagtctg 6 0 

ctggcacaca ccacatgcag attttggtgn tttcccaacc ttcttggtat aaaggtaaac 120 

aattctatta ccaggggttc gggacagcct agttttgtta gaggctgtat tgtaggaaag 180 

cctacgtcgg tatgtcaana nctntncgaa ttcggccgag gggaacgtca tcgtttggaa 240 

agcgtcgcaa taagacgcac acgttgtgcc gccgctgtgg ctctaaggcc taccaccttc 300 

agaagtcgac ctgtggcaaa tgtggctacc ctgccaagcg caagagaaag tataactgga 3 60 

gtgccaaggc taaaagacga aataccaccg gaactggtcg aatgaggcac ctaaaaattg 420 

tataccgcag attcaggcat ggattccgtg aaggaacaac acctaaaccc aagagggcag 4 80 

ctgttgcagc atccagttca tcttaagaat gtcaacgatt agtcatgcaa taaatgttct 54 0 
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ggttttaaaa 


aatacatatc 


tggttttggt 


aaggtatttt 


taatcaatta 


ggcttgtagt 


600 


atcagtgaaa 


tactgtaggt 


ttagggactg 


ggctagcttc 


atatcagatt 


tacttgttaa 


660 


gtgactgttt 


tggaatgttt 


acttttggac 


tgggtttgta 


acacggttaa 


aggcaatgag 


720 


aaacaagcag aattccagga 


crtccttcraaa 


cacraQcrcicac 


tocraaaacaa 


tatagca 


777 


<210> 90 

<211> 564 

<212> DNA 

<213> Homo sapien 












<400> 90 
atgcatgccg 


agcggcgcag 


tgtgatggat 


cgtggtcgcg 


gccgaggtcg 


ttgtgccgcc 


60 


gctgtggctc 


taaggcctac 


caccttcaga 


agtcgacctg 


tggcaaatgt 


ggctaccctg 


120 


ccaagcgcaa 


gagaaagtat 


aactggagtg 


ccaaggctaa 


aagacgaaat 


accaccggaa 


180 


ctggtcgaat 


gaggcaccta 


aaaattgtat 


accgcagatt 


caggcatgga 


ttccgtgaag 


240 


gaacaacacc 


taaacccaag 


agggcagctg 


ttgcagcatc 


cagttcatct 


taagaatgtc 


300 


aacgattagt 


catgcaataa 


atgttctggt 


tttaaaaaat 


acatatctgg 


ttttggtaag 


360 


gtatttttaa 


tcaattaggc 


ttgtagtatc 


agtgaaatac 


tgtaggttta 


gggactgggc 


420 


tagcttcata 


tcagatttac 


ttgttaagtg 


actgttttgg 


aatgtttact 


tttggactgg 


480 


gtttgtaaca 


cggttaaagg 


caatgagaaa 


caagcagaat 


tccaggagtc 


cttgaagcag 


540 


agggcactgg 


aagacaatat 










564 


<210> 91 

<211> 540 

<212> DNA 

<213> Homo sapien 












<400> 91 

atgcatgccg agcggcgcag 


tgtgatggat 


cgtggtcgcg 


gccgaggtag 


gcctaccacc 


60 


ttcagaagtc 


gacctgtggc 


aaatgtggct 


accctgccaa 


gcgcaagaga 


aagtataact 


120 


ggagtgccaa ggctaaaaga 


cgaaatacca 


ccggaactgg 


tcgaatgagg 


cacctaaaaa 


180 


ttgtataccg 


cagattcagg 


catggattcc 


gtgaaggaac 


aacacctaaa 


cccaagaggg 


240 


cagctgttgc 


agcatccagt 


tcatcttaag 


aatgtcaacg 


attagtcatg 


caataaatgt 


300 


tctggtttta 


aaaaatacat 


atctggtttt 


ggtaaggtat 


ttttaatcaa 


ttaggcttgt 


360 


agtatcagtg 


aaatactgta 


ggtttaggga 


ctgggctagc 


ttcatatcag 


atttacttgt 


420 


taagtgactg 


ttttggaatg 


tttacttttg 


gactgggttt 


gtaacacggt 


taaaggcaat 


480 


gagaaacaag 


cagaattcca 


ggagtccttg 


aagcagaggg 


cactggaaga 


caatatagca 


540 
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<210> 92 

<211> 1103 

<212> DNA 

<213> Homo sapien 



<400> 92 


cyay cy y cy o 


aggtgatgga 


tgcgcccggg 


caggtggccg agcaggaggc 


60 




yay tggacat 


ccgccataac 


aaggaccgaa 


aggttcggcg caaggagccc 


120 


a. ay ay ^- a y y 




gaggctgttg gtcaagttat 


acaggtttct 


ggecagaaga 


180 






ggttgtgttg aagaggttgt 


ttatgagtcg 


caccaaccgg 


240 


ucgcctccgc 


ccctcccccg 


gatgatcegg 


aagatgaagc 


ttcctggccg ggaaaacaag 


300 


acygecytgg 


t cgeggggae 


cataactgat 


gatgtgcggg 


ttcaggaggt 


acccaaactg 


360 


aayytacyuy 


CaCtyCycgu 


gaccagccgg 


gcccgcagcc 


gcatcctcag ggcagggggc 


420 


a ciy aLLLLLd 


CCttCyaCCa 


gctggccctg gactccccta 


agggctgtgg 


cactgtcctg 


480 


LLLtucyytc 


cuegcaaggg 


ccgagaggtg 


taceggcatt 


teggcaagge 


cccaggaacc 


540 


^*^»y wdvciy 


a^auuaay cy 


agtatcaggc 


ccccagccct 


gccctctccc 


cagactcagc 


600 


ctgcagggcc 


aggcctggcc 


acacttgggc 


tgcttctcct 


atccctcgta 


ccagccagcc 


660 


ccaggccttc 


tggaactctc 


agctgcctca 


tcctgcctgc 


ttttcccttc 


aacactgett 


720 


ggagctgtgt 


tttccagggc 


ctgagcaggc 


atctcctggg 


gagcatgeag 


ttcaccggcc 


780 


tttgccagcc 


caggctggga 


ccctccaggt 


ggtggggcag 


gtgettccat 


agcccctcct 


840 


ggctccttgg 


cttcccactc 


tgecaagett 


ttccagcagc 


attgactctc 


ctggggctga 


900 


atgtgacagg 


cttgtggtgt 


tcctgtgcca 


ctccacaatg 


gcctccttta 


agagggcccc 


960 


cctctcactc 


cctcctctcc 


ccacagaccc 


tacgtccgct 


ccaagggccg gaagttcgag 


1020 


cgtgccagag 


gccgacgggc 


cagccgaggc 


tacaaaaact 


aaccctggat 


cctactctct 


1080 


tattaaaaag 


atttttgetg 


aca 








1103 


<210> 93 

<211> 398 

<212> DNA 

<213> Homo sapien 












<400> 93 
gatgeatget 


cgagcggcgc 


aggtgatgga 


tgcgcccggg 


caggtggccg agcaggaggc 


60 


gecatcatgg 


gagtggacat 


ccgccataac 


aaggaccgaa 


aggttcggcg 


caaggagccc 


120 


aagagecagg 


atatctacct 


gaggctgttg 


gtcaagttat 


acaggtttct 


ggecagaaga 


180 


accaactcca 


cattcaacca 


ggttgtgttg aagaggttgt 


ttatgagtcg 


caccaaccgg 


240 


ccgcctctgt 


ccctttcccg 


gatgatcegg aagatgaagc 


ttcctggccg ggaaaacaag 


300 



WO 2004/050900 

acggccgtgg ttgtggggac cataactgat 
atcggccgcg accacgctaa gccggattcc 

<210> 94 

<211> 1673 

<212> DNA 

<213> Homo sapien 

<400> 94 

gccagtgtga tggattggtc gcggcgaggt 
gacatccgcc ataacaagga ccgaaaggtt 
tacctgaggc tgttggtcaa gttatacagg 
aaccaggttg tgttgaagag gttgtttatg 
tcccggatgg tgagtggctg gtccagagag 
tctgtctgga agggaggcag gacttggagc 
tctcagccag tcgcgagcgt agagcttgga 
gtgggcaagt ctcctggtcc tcctctcctt 
ttcaggagat tgccccaagt cataactgag 
cctattgaac agtctgcatt ttttttttta 
gaggcctgtg gggtttaagg ggtgtggtgt 
gccctggcct ggggatgcct caagcggggc 
tacatcctcc ccaccctaag atccggaaga 
ccgtggttgt ggggaccata actgatgatg 
tgagctggcg ggggctgggc agacccatca 
gtgctgtgct gtgctgtgct ggtccatcct 
cctgcctgtc tgagtggctt tgaggattgg 
gggtcccctg ggcaagcttc ttctggggca 
gccagacagc ctgggctcaa gtctagactc 
gcccagcttc cttgctgtag tttctagcat 
gaagtgttga gaactgctgc tgtcagccca 
gttaccaggg ctctgctgcg ccaaggcagt 
ccacgcaggg gtctgacccg ccagccttgt 
accagccggg cccgcagccg catcctcagg 
ctggccctgg actcccctaa gggctgtggc 
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gatgtgcggg ttcaggaggt acccaaactg 360 
agcacact 398 



ggccgagcag gaggcgccat catgggagtg 60 

cggcgcaagg agcccaagag ccaggatatc 120 

tttctggcca gaagaaccaa ctccacattc 180 

agtcgcacca accggccgcc tctgtccctt 240 

cacggtagac ctgggagccg ctgggtcttg 300 

tctgggagca accagggcat caagggtttg 3 60 

ctgttggttg gaactgagtg tccatgggca 42 0 

ctctttgtga ggtggaggtg tcctggtggc 480 

ttttatgtca aaggctctgc tgagggaggc 540 

tctggagctt tccccagctg gacctgagga 600 

gagtgggggg tcccgttaga gggcacagct 660 

tgaatgtaaa caccagaaca acttacgacg 720 

tgaagcttcc tggccgggaa aacaagacgg 780 

tgcgggttca ggaggtaccc aaactgaagg 840 

gacccttgct gtactgtgct gtgctgtgct 900 

cagtcttgcc accgctggtg aggcaagcat 960 

tgtgggtcgt ctctggcatg gttgatggtg 1020 

agttgtaaac tgttagtgct ggcccctaga 1080 

ctttgtcacc tctgcacagg tctttttttg 1140 

aacaccaggg atgtggtggt gaaagttccg 12 00 

gggtgctccg ggtagtaaac gatagtgttg 1260 

ggagggtgag tcctggcagc ccctgggtga 1320 

cctctccacc aggtatgtgc actgcgcgtg 13 80 

gcagggggca agatcctcac tttcgaccag 1440 

actgtcctgc tctccggtcc tcgcaagggc 15 00 
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cgagaggtgt 


accggcattt 


cggcaaggcc 


ccaggaaccc 


cgcacagcca 


caccaaaccc 


1560 


tacgtccgct 


ccaagggccg 


gaagttcgag 


cgtgccagag 


gccgacgggc 


cagccgaggc 


1620 


tacaaaaact 


aaccctggat 


cctactctct 


tattaaaaag 


atttttgctg 


aca 


1673 


<210> 95 

<211> 1121 

<212> DNA 

<213> Homo sapien 












<400> 95 

tgagtgcctg gaccctgacc 


tccaacggga 


gatccttccc ggggccattc 


cctaaatcgt 


60 


gggcttgttt 


cttgctccct 


ctggctattc 


tgtgtccctg 


tggctgctcc 


ccaaccctaa 


120 


gggctgtccc 


tgatctctct 


tgctatttcc 


ccaagcccga 


gactgctccc 


ttacaaattc 


180 


tggctgctcc 


ctttccctgt 


gtccaatacc 


gtgtgatcac 


ctgtccctcc 


ctggtccctc 


240 


taattctccc 


ttgtgactat 


tctgtaatcc 


cactccccgt 


tcctgagccc 


ccgggattgt 


300 


ttcttggttc 


cccagaatgt 


tctccaagaa 


cccaaagtgc 


tgttccctaa 


tcccactgac 


360 


ccttcctcag 


cacatctgct 


tgttgctctg 


cagctctgac 


tattccagac 


accagagggt 


420 


gttacactga 


cctcctggct 


agtccaagcc 


tgttccctga 


tggctcctga 


tggcctctga 


480 


gcccagtgac 


tatcttgaag 


ccccactggc 


tattcctgtg 


cttaaactcc 


ttagccgcca 


540 


accgggtggg gccccttact 


actgccagtt 


tttccctgtc 


ctttgggaca 


attccacagg 


600 


cgacctgcct 


gactgacccc 


cagggcctat 


tccccagcca 


ccattccagg acactgattt 


660 


actggcttac 


tgttaccttt 


caggacccag 


tgacatttcc 


ctcaatggga 


ggggctgtcc 


720 


tggggtatgt 


acttaggtgg 


ggtgagagca 


agtctcccca gcggctcagt 


gttgaaagct 


780 


gaggagagga 


agttctgctc 


cacgggggag 


gcagggagaa 


gccctcagag 


atgtttccct 


840 


cctcccttcc 


aggtgtttcc 


ctgtttatcc 


atcccctgca 


aactgcagag 


tggcactcat 


900 


tgcttgtgga 


cggaccagct 


cctccaaggc 


tctgaaaagg 


gcttccagtc 


ccgtcacctt 


960 


gcctgcctgc 


ctcgggagcc 


agggctgtgc 


acctggcagt 


ccctgcggtc 


ccagatagcc 


1020 


tgaatcctgc 


ccggagtgga 


agctgaagcc 


tgcacagtgt 


ccaccctgtt 


cccactccca 


1080 


tctttcttcc 


ggacaatgaa 


ataaagagtt 


accacccagc 


a 




1121 


<210> 96 

<211> 570 

<212> DNA 

<213> Homo sapien 












<400> 96 

ccagtccctg gggcgcggcc 


tagcaaccac 


gagggggcga ggctctgatg ggaatggtcc 


60 
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cactggaaat 


ggggaccacc 


ccaatttcag 


1J« 
tctatcagaa 


ggccgggcct 


ttggcagctt 


120 


ggcagctcag 


ccaatcacaa 


gctgcttgtc 


ggtccccgcc 


ccgcctttct 


ccttaggaaa 


180 


actgcaggat 


ggactcttgc 


acatcactac 


ctgcagtttt 


gtggctccct 


ggaacagcct 


240 


gagcttagct 


cagcgccggg 


gcttcaccaa 


gacctacact 


gttggctgtg 


aggaatgcac 


300 


agtgtttccc 


tgtttatcca 


tcccctgcaa 


actgcagagt 


ggcactcatt 


gcttgtggac 


360 


ggaccagctc 


ctccaaggct 


ctgaaaaggg 


cttccagtcc 


cgtcaccttg 


cctgcctgcc 


420 


tcgggagcca 


gggctgtgca 


cctggcagtc 


cctgcggtcc 


cagatagcct 


gaatcctgcc 


480 


cggagtggaa gctgaagcct 


gcacagtgtc 


caccctgttc 


ccactcccat 


ctttcttccg 


540 


gacaatgaaa 


taaagagtta 


ccacccagca 








570 


<210> 97 

<211> 4990 

<212> DNA 

<213> Homo sapien 












<400> 97 
gggagtcggc 


ggcacaaaat 


ggcggcggcg 


gcggcggcgg 


ctggtgctgc 


agggtcggca 


60 


gctcccgcgg 


cagcggccgg 


cgccccggga 


tctgggggcg 


caccctcagg 


gtcgcagggg 


120 


gtgctgatcg 


gggacaggct 


gtactccggg 


gtgctcatca 


ccttggagaa 


ctgcctcctg 


180 


cctgacgaca 


agctccgttt 


cacgccgtcc 


atgtcgagcg 


gcctcgacac 


cgacacagag 


240 


accgacctcc 


gcgtggtggg 


ctgcgagctc 


atccaggcgg 


ccggtatcct 


gctccgcctg 


300 


ccgcaggtgg 


ccatggctac 


cgggcaggtg 


ttgttccagc 


ggttctttta 


taccaagtcc 


360 


ttcgtgaagc 


actccatgga 


gcatgtgtca 


atggcctgtg 


tccacctggc 


ttccaagata 


420 


gaagaggccc 


caagacgcat 


acgggacgtc 


atcaatgtgt 


ttcaccgcct 


tcgacagctg 


480 


agagacaaaa 


agaagcccgt 


gcctctacta 


ctggatcaag 


attatgttaa 


tttaaagaac 


540 


caaattataa 


aggcggaaag 


acgagttctc 


aaagagttgg 


gtttctgcgt 


ccatgtgaag 


600 


catcctcata 


agataatcgt 


tatgtacctt 


caggtgttag 


agtgtgagcg 


taaccaacac 


660 


ctggtccaga 


cctcatgggt 


agcctctgag 


ggtaagtgac 


taagacttct 


cctctgctgt 


720 


ccaagcgctt 


tggtgcaggg 


acagcggcat 


cttcagccaa 


tccagtgcag 


gctctccacc 


780 


gaaggctggc 


tctagactgg 


tggtacgcac 


atagcatagc 


catggccgac 


tcctgctgtg 


840 


gttctctgac 


gattgtgctt 


cttgttaatc 


ctctgtcgtg 


ctttggtaat 


cgtattgatt 


900 


agagttggta 


actgtcttga 


cttgaatttt 


gtccctttaa 


aactgctgta 


cctgtatgat 


960 


aaagatgcag 


tacctttctc 


ttaaaaaaaa 


atgctatgga 


aagctgtgag 


aattgaagag 


1020 


acaaattggc 


tgtgtcagtg 


tggggttatg 


tcatgatttc 


tagaagccct 


gaagttgctc 


1080 
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ttttgagcag ctttgcatga cacgctctgg taaaaggtgt gcatctttaa attatttcat 114 0 

ggatactttg aaaaatattg tatcacttca aatacagcaa taagtttata tgttctcaag 1200 

atttcatttg tttttaagaa ttttaagttc gtggattaat atcactactt gaatactgac 1260 

agttgttgat tagacaccga aaggttactg attgttgaat gtatctgtgt tagagctgtg 1320 

cactggcacg cttgcatcag gggctggggc cacacggccg ccacacagat tcccccgtga 13 80 

tgcctggagc tgcttccaga gccgggtgtc tccaagaggc acctgtagga cttcccattt 1440 

agaaatctct tgagtgggtt tgtatgttac cttctccaag gtttatttag gacagagata 1500 

ttgctggaag gtcatgggtc agattccctc acaacccacc tcgtctgcgg gtgcagcccc 1560 

actccaaggc tccccgttat tggggtatgt gaggagcagt aaatataaaa ccagttcaac 1620 

tgtcctcatg gaatcaccct ttctgttttt gcagtattca taaagctagt gtaaggtctg 1680 

gttttagtct attaaatctt agagatctaa aggaaatgct caaaatgtag ccaggtttta 1740 

aatgctttaa cttttaaaaa atgtaaattt ttgtatgttt atagcttcta aatatgaaag 1800 

ttaaagaatg tactgtgatg aaatgttcag tattatgttg cttctcagta tcatgttgct 1860 

tctcagtatt gtgttgcttc tgattctatg aatgttcatt ttaagacccc ttgttgaaat 1920 

gggacagttg gcagcggctc tgatgagccc gagaagaggc ctgcccttgg gtgcggagtc 1980 

tccctccgca cgatgctccc acgcgtccaa cttgcaccca aggggctttt ccctcttcca 2040 

agtggactcc ttcaaggaag ctgcagctcg gtcagcagag aaggggcctg ccgccagcgc 2100 

cctggaggaa gaggaagagg aacccaagag gatggcttgt ctcccagcag ccacaccggc 2160 

tttgtgctca gccagttcat ttgagtttgc atgtttctct gcactatgga ttttgagcat 2220 

ttagatttct ttaatcaaaa gcgttttagt gactccagta gacattttct ttctgaggca 2280 

tcgtgctttg catgagagca ggccaaggtt gaggggaaaa gtaaagttaa agtcggttct 2340 

ctttcatagc aacacgtatt gtctgacatt cagccagctt tttttttttc taataatttc 2400 

tgtgcctttc tgtcctgtat ttactgtatt tagaaaaagc agctagaata tttctccatt 2460 

aactcttgag attcacagga ctgtctagct ctgagtccta gcaatagact ccttagagga 2520 

gtagtacgtt tatctagatt ttctctagat aatgcaggcg gaagacctgg gttcccgggt 2580 

ggggcattgc agttcttcct gtgtttggct tccaggaatt acatgaacga cagccttcgc 2640 

accgacgtct tcgtgcggtt ccagccagag agcatcgcct gtgcctgcat ttatcttgct 2700 

gcccggacgc tggagatccc tttgcccaat cgtccccatt ggtttctttt gtttggagca 2760 

actgaagaag aaattcagga aatctgctta aagatcttgc agctttatgc tcggaaaaag 2820 

gttgatctca cacacctgga gggtgaagtg gaaaaaagaa agcacgctat cgaagaggca 2880 

aaggcccaag cccggggcct gttgcctggg ggcacacagg tgctggatgg tacctcgggg 2940 
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ttctctcctg 


cccccaagct 


ggtggaatcc 


cccaaagaag 


gtaaagggag 


caagccttcc 


3000 


ccactgtctg 


tgaagaacac 


caagaggagg 


ctggagggcg 


ccaagaaagc 


caaggcggac 


3060 


agccccgtga 


acggcttgcc 


aaaggggcga 


gagagtcgga 


gtcggagccg gagccgtgag 


3120 


cagagctact 


cgaggtcccc 


atcccgatca 


gcgtctccta 


agaggaggaa 


aagtgacagc 


3180 


ggctccacat 


ctggtgggtc 


caagtcgcag 


cgctccctac 


aaaggctctg 


agattcgggg 


3240 


ctcccggaag 


tccaaggact 


gcaagtaccc 


ccagaagcca 


cacaagtctc 


ggagccggag 


3300 


ttcttcccgt 


tctcgaagca 


ggtcacggga 


gcgggcggat 


aatccgggaa aatacaagaa 


3360 


gaaaagtcat 


tactacagag 


atcagcgacg 


agagcgctcg 


aggtcgtatg 


aacgcacagg 


3420 


ccgtcgctat 


gagcgggacc 


accctgggca 


cagcaggcat 


cggaggtgag gcggggttgc 


3480 


agtgactggt 


ggccgcaagc 


ccttccctgg 


ggagtacctg 


atggctgccc 


tttgaccccc 


3540 


ggtggctgcc 


ctttgacccc 


cgggtgtgct 


ctcagcgcaa 


gtggtcctag 


aacaggattc 


3600 


tttttggaaa 


tgtctgtcga 


ctggaccttg 


gtggatttgg 


aaatggaact 


gagggaccgg 


3660 


tgacacgtgc 


ttcagaccgg 


tctggggtgc 


ggcgcacacc 


tgggcccgtg cagggctcag 


3720 


ctcggcagca 


gctctgaggg 


cagctcaatg 


aaaaagtgaa 


tgcacacgcc 


cttgttggcg 


3780 


tggcctggca 


tggcctggtg 


ctatcggcag 


ccgctctcca 


ctccccgact 


gatactcaat 


3840 


tacgtgaagc 


caagaaagat 


gatttttaga 


acctttgcct 


atattaggtt 


gtacttatgt 


3900 


acatattttg 


cagtgtttca 


caggagaaag 


tggccttaac 


tgccccttat 


tctctctcca 


3960 


cgttgtaaat 


aaacatgtgt 


ttaatacaag 


ttaaagctat 


gtatgaaaac 


tcagaacttg 


4020 


aatcccgtca 


gcttaaaact 


tgtgtaggga 


atcctgactt 


ttaaaatgtg 


agggtatttg 


4080 


gatctgtgtt 


gaaagtcgta 


tatttttatc 


tgtgcggtgc 


tgagtgcagg 


ccaccagctc 


4140 


ctaaatagag 


gttccctata 


tgcgcgtatg 


acatggtgaa 


taaacacaac 


tctctccact 


4200 


caggacatcc 


ggagcgttat 


ggacgtggta 


ggtggtcgtt 


ctgtgtgctt 


gtgaaagtgt 


4260 


ccaggcgtgt 


gcacagccag 


tgcgcccact 


tccgggctcc 


ttgctccctg 


ctgtactgaa 


4320 


gttttggatt 


ttgcatccaa 


tcctgtgtgc 


ctgcccttct 


gccgaaggct 


tgtgaggggc 


4380 


ctgagtcctc 


tgcccatcag 


gatgacaggc 


tccttcctgc 


agggccatag gagggaagtt 


4440 


ttggaaacac 


agaatgattc 


caaggtgctc 


tcgttcctga 


gggggactgg 


tttgtaaccc 


4500 


atgacatctg 


tgggcgagag 


aggcagctgg 


gagcaggaca 


cttggagggt 


caccccacgg 


4560 


gggtggcacc 


tgcactctga 


gtgcccccca 


ctgtcatcag 


ctgcctctta 


ccgtggacac 


4620 


agttttggtt 


ttggggacta 


gggggcccca 


ctcctggtgg 


taccgtttgg acttactagg 


4680 


gcagtgggac 


atataggccg 


gggctagtgg 


gataacgggg 


agttacgcct gatgactttt 


4740 
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ttgatggaat 


cctgcattag 


atagctggtg ggaccccccc 


ctcagaattg 


gggaactgag 


4800 


gagactccag ggagggtgtc 


cttccaggga gagcagctat 


gaggggcccc 


ctagcttcct 


4860 


gtgcctggaa 


gtaagagaac 


cagtaaaggg 


ccatacacac 


ctgtacccaa 


gagaccgctc 


4920 


tccatttgct 


ttcttttttt 


actaaataat 


tgtaaaatat 


tattatgaca 


taaagaacca 


4980 


tttaaggcca 












4990 


<210> 98 
<211> 4944 
<212> DNA 
<213> Homo sapien 












<400> 98 
gggagtcggc 


ggcacaaaat 


ggcggcggcg 


gcggcggcgg 


ctggtgctgc 


agggtcggca 


60 


gctcccgcgg 


cagcggccgg 


cgccccggga 


tctgggggcg 


caccctcagg 


gtcgcagggg 


120 


gtgctgatcg 


gggacaggct 


gtactccggg gtgctcatca 


ccttggagaa 


ctgcctcctg 


180 


cctgacgaca 


agctccgttt 


cacgccgtcc 


atgtcgagcg 


gcctcgacac 


cgacacagag 


240 


accgacctcc 


gcgtggtggg 


ctgcgagctc 


atccaggcgg 


ccggtatcct 


gctccgcctg 


300 


ccgcaggtgg 


ccatggctac 


cgggcaggtg 


ttgttccagc 


ggttctttta 


taccaagtcc 


360 


ttcgtgaagc 


actccatgga 


gcatgtgtca 


atggcctgtg 


tccacctggc 


ttccaagata 


420 


gaagaggccc 


caagacgcat 


acgggacgtc 


atcaatgtgt 


ttcaccgcct 


tcgacagctg 


480 


agagacaaaa 


agaagcccgt 


gcctctacta 


ctggatcaag 


attatgttaa 


tttaaagaac 


540 


caaattataa 


aggcggaaag 


acgagttctc 


aaagagttgg 


gtttctgcgt 


ccatgtgaag 


600 


catcctcata 


agataatcgt 


tatgtacctt 


caggtgttag 


agtgtgagcg 


taaccaacac 


660 


ctggtccaga 


cctcatgggt 


agcctctgag ggtaagtgac 


taagacttct 


cctctgctgt 


720 


ccaagcgctt 


tggtgcaggg 


acagcggcat 


cttcagccaa 


tccagtgcag 


gctctccacc 


780 


gaaggctggc 


tctagactgg 


tggtacgcac 


atagcatagc 


catggccgac 


tcctgctgtg 


840 


gttctctgac 


gattgtgctt 


cttgttaatc 


ctctgtcgtg 


ctttggtaat 


cgtattgatt 


900 


agagttggta 


actgtcttga 


cttgaatttt 


gtccctttaa 


aactgctgta 


cctgtatgat 


960 


aaagatgcag 


tacctttctc 


ttaaaaaaaa 


atgctatgga 


aagctgtgag 


aattgaagag 


1020 


acaaattggc 


tgtgtcagtg 


tggggttatg 


tcatgatttc 


tagaagccct 


gaagttgctc 


1080 


ttttgagcag 


ctttgcatga 


cacgctctgg 


taaaaggtgt 


gcatctttaa 


attatttcat 


1140 


ggatactttg 


aaaaatattg 


tatcacttca 


aatacagcaa 


taagtttata 


tgttctcaag 


1200 


atttcatttg 


tttttaagaa 


ttttaagttc 


gtggattaat 


atcactactt 


gaatactgac 


1260 


agttgttgat 


tagacaccga 


aaggttactg 


attgttgaat 


gtatctgtgt 


tagagctgtg 


1320 
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cactggcacg cttgcatcag gggctggggc cacacggccg ccacacagat tcccccgtga 
tgcctggagc tgcttccaga gccgggtgtc tccaagaggc acctgtagga cttcccattt 
agaaatctct tgagtgggtt tgtatgttac cttctccaag gtttatttag gacagagata 
ttgctggaag gtcatgggtc agattccctc acaacccacc tcgtctgcgg gtgcagcccc 
actccaaggc tccccgttat tggggtatgt gaggagcagt aaatataaaa ccagttcaac 
tgtcctcatg gaatcaccct ttctgttttt gcagtattca taaagctagt gtaaggtctg 
gttttagtct attaaatctt agagatctaa aggaaatgct caaaatgtag ccaggtttta 
aatgctttaa cttttaaaaa atgtaaattt ttgtatgttt atagcttcta aatatgaaag 
ttaaagaatg tactgtgatg aaatgttcag tattatgttg cttctcagta tcatgttgct 
tctcagtatt gtgttgcttc tgattctatg aatgttcatt ttaagacccc ttgttgaaat 
gggacagttg gcagcggctc tgatgagccc gagaagaggc ctgcccttgg gtgcggagtc 
tccctccgca cgatgctccc acgcgtccaa cttgcaccca aggggctttt ccctcttcca 
agtggactcc ttcaaggaag ctgcagctcg gtcagcagag aaggggcctg ccgccagcgc 
cctggaggaa gaggaagagg aacccaagag gatggcttgt ctcccagcag ccacaccggc 
tttgtgctca gccagttcat ttgagtttgc atgtttctct gcactatgga ttttgagcat 
ttagatttct ttaatcaaaa gcgttttagt gactccagta gacattttct ttctgaggca 
tcgtgctttg catgagagca ggccaaggtt gaggggaaaa gtaaagttaa agtcggttct 
ctttcatagc aacacgtatt gtctgacatt cagccagctt tttttttttc taataatttc 
tgtgcctttc tgtcctgtat ttactgtatt tagaaaaagc agctagaata tttctccatt 
aactcttgag attcacagga ctgtctagct ctgagtccta gcaatagact ccttagagga 
gtagtacgtt tatctagatt ttctctagat aatgcaggcg gaagacctgg gttcccgggt 
ggggcattgc agttcttcct gtgtttggct tccaggaatt acatgaacga cagccttcgc 
accgacgtct tcgtgcggtt ccagccagag agcatcgcct gtgcctgcat ttatcttgct 
gcccggacgc tggagatccc tttgcccaat cgtccccatt ggtttctttt gtttggagca 
actgaagaag aaattcagga aatctgctta aagatcttgc agctttatgc tcggaaaaag 
gttgatctca cacacctgga gggtgaagtg gaaaaaagaa agcacgctat cgaagaggca 
aaggcccaag cccggggcct gttgcctggg ggcacacagg tgctggatgg tacctcgggg 
ttctctcctg cccccaagct ggtggaatcc cccaaagaag gtaaagggag caagccttcc 
ccactgtctg tgaagaacac caagaggagg ctggagggcg ccaagaaagc caaggcggac 
agccccgtga acggcttgcc aaaggggcga gagagtcgga gtcggagccg gagccgtgag 
cagagctact cgaggtcccc atcccgatca gcgtctccta agaggaggaa aagtgacagc 



1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 
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ggctccacat 


ctggtgggtc 


caagtcgcag 


agccgctccc 


ggagcaggag 


tgactcccca 


3240 


ccgagacagg 


ccccccgcag 


cgctccctac 


aaaggctctg 


agattcgggg 


ctcccggaag 


3300 


tccaaggact 


gcaagtaccc 


ccagaagcca 


cacaagtctc 


ggagccggag 


ttcttcccgt 


3360 


tctcgaagca 


ggtcacggga 


gcgggcggat 


aatccgggaa 


aatacaagaa 


gaaaagtcat 


3420 


tactacagag 


atcagcgacg 


agagcgctcg 


aggtcgtatg 


aacgcacagg 


ccgtcgctat 


3480 


gagcgggacc 


acccggtggc 


tgccctttga 


cccccgggtg 


tgctctcagc 


gcaagtggtc 


3540 


ctagaacagg 


attctttttg 


gaaatgtctg 


tcgactggac 


cttggtggat 


ttggaaatgg 


3600 


aactgaggga 


ccggtgacac 


gtgcttcaga 


ccggtctggg 


gtgcggcgca 


cacctgggcc 


3660 


cgtgcagggc 


tcagctcggc 


agcagctctg 


agggcagctc 


aatgaaaaag 


tgaatgcaca 


3720 


cgcccttgtt 


ggcgtggcct 


ggcatggcct 


ggtgctatcg 


gcagccgctc 


tccactcccc 


3780 


gactgatact 


caattacgtg 


aagccaagaa 


agatgatttt 


tagaaccttt 


gcctatatta 


3840 


ggttgtactt 


atgtacatat 


tttgcagtgt 


ttcacaggag 


aaagtggcct 


taactgcccc 


3900 


ttattctctc 


tccacgttgt 


aaataaacat 


gtgtttaata 


caagttaaag 


ctatgtatga 


3960 


aaactcagaa 


cttgaatccc 


gtcagcttaa 


aacttgtgta 


gggaatcctg 


acttttaaaa 


4020 


tgtgagggta 


tttggatctg 


tgttgaaagt 


cgtatatttt 


tatctgtgcg 


gtgctgagtg 


4080 


caggccacca 


gctcctaaat 


agaggttccc 


tatatgcgcg 


tatgacatgg 


tgaataaaca 


4140 


caactctctc 


cactcaggac 


atccggagcg 


ttatggacgt 


ggtaggtggt 


cgttctgtgt 


4200 


gcttgtgaaa 


gtgtccaggc 


gtgtgcacag 


ccagtgcgcc 


cacfctccggg 


ctccttgctc 


4260 


cctgctgtac 


tgaagttttg 


gattttgcat 


ccaatcctgt 


gtgcctgccc 


ttctgccgaa 


4320 


ggcttgtgag 


gggcctgagt 


cctctgccca 


tcaggatgac 


aggctccttc 


ctgcagggcc 


4380 


ataggaggga 


agttttggaa 


acacagaatg 


attccaaggt 


gctctcgttc 


ctgaggggga 


4440 


ctggtttgta 


acccatgaca 


tctgtgggcg agagaggcag 


ctgggagcag 


gacacttgga 


4500 


gggtcacccc 


acgggggtgg 


cacctgcact 


ctgagtgccc 


cccactgtca 


tcagctgcct 


4560 


cttaccgtgg 


acacagtttt 


ggttttgggg 


actagggggc 


cccactcctg 


gtggtaccgt 


4620 


ttggacttac 


tagggcagtg 


ggacatatag gccggggcta 


gtgggataac 


ggggagttac 


4680 


gcctgatgac 


ttttttgatg 


gaatcctgca 


ttagatagct 


ggtgggaccc 


ccccctcaga 


4740 


attggggaac 


tgaggagact 


ccagggaggg 


tgtccttcca 


gggagagcag 


ctatgagggg 


4800 


ccccctagct 


tcctgtgcct 


ggaagtaaga 


gaaccagtaa 


agggccatac 


acacctgtac 


4860 


ccaagagacc 


gctctccatt 


tgctttcttt 


ttttactaaa 


taattgtaaa 


atattattat 


4920 


gacataaaga 


accatttaag 


gcca 








4944 
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<210> 99 

<211> 438 

<212> DNA 

<213> Homo sapien 

<400> 99 

gcctactact actaaattcg cggccgcgtc gaccttcttg aactggtgct gtctgggctt 60 

catagcattc gcctactccg tgaagtctag ggacaggaag atggttggcg acgtgaccgg 120 

ggcccaggcc tatgcctcca ccgccaagtg cctgaacatc tgggccctga ttttgggcat 180 

cttcatgacc ggattcatcc tgttactggt attcggctct gtgacagtct accatattat 240 

gttacagata atacaggaaa aacggggtta ctagtagccg cccatagcct gcaacctttg 300 

cactccactg tgcaatgctg gccctgcacg ctggggctgt tgcccctgcc cccttggtcc 360 

tgcccctaga tacagcagtt tatacccaca cacctgtcta cagtgtcatt caataaagtg 420 

cacgtgcttg tgaaaaar 438 

<210> 100 

<211> 450 

<212> DNA 

<213> Homo sapien 

<400> 100 

agattaatat atagggcaat tggtcatcta atcatgctcg agcggcgcag tgtgatggat 60 

yggycgccgg scgaggtacg tgaaaccaaa attaagggta aaagtgggag attcttcaca 120 

gtcaagctcc cagttgctct tgatcctggg gccaagattt cagtcattgt ggaaacagtc 180 

tacacccatg tgcttcatcc atatccaacc cagatcaccc agtcagagaa acagtttgtg 240 

gtgtttgagg ggaaccatta tttctactct ccctatccaa cgaagacaca aaccatgcgt 3 00 

gtgaagcttg cctctcgaaa tgtggagagc tacaccaagc tggggaaccc cacgcgctct 3 60 

gaggacctac tggattatgg gcctttcaga gatgtgcctg cctatagtca ggatactttt 420 

aaagtacctc ggccgcgacc acgctaagcc 450 

<210> 101 

<211> 1965 

<212> DNA 

<213> Homo sapien 

<400> 101 

agcggaatga acggctgctc agtggcggga gggcgggcta ccgcgcccgg gccctacccg 60 

ccctggcgca ctgctcttcc cggtcatgga ggcgccagcc gccggcttgt ttctgctcct 120 

gttgcttggg acttgggccc cggcgccggg cagcgcctcc tccgaggcac cgccgctgat 180 

caatgaggac gtgaagcgca cagtggacct aagcagccac ctggctaagg tgacggccga 240 
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ggtggtcctg gcgcacctgg gcggcggctc cacgtcccga gctacctctt tcctgctggc 300 

tttggagcct gagctcgagg cccggctggc gcacctgggc gtgcaggtaa agggagaaga 360 

tgaggaagag aacaatttgg aagtacgtga aaccaaaatt aagggtaaaa gtgggagatt 420 

cttcacagtc aagctcccag ttgctcttga tcctggggcc aagatttcag tcattgtgga 480 

aacagtctac acccatgtgc ttcatccata tccaacccag atcacccagt cagagaaaca 540 

gtttgtggtg tttgagggga accattattt ctactctccc tatccaacga agacacaaac 600 

catgcgtgtg aagcttgcct ctcgaaatgt ggagagctac accaagctgg ggaaccccac 660 

gcgctctgag gacctactgg attatgggcc tttcagagat gtgcctgcct atagtcagga 720 

tacttttaaa gtacattatg agaacaacag ccctttcctg accatcacca gcatgacccg 780 

agtcattgaa gtctctcact ggggtaatat tgctgtggaa gaaaatgtgg acttaaagca 840 

cacaggagct gtgcttaagg ggcctttctc acgctatgat taccagagac agccagatag 900 

tggaatatcc tccatccgtt cttttaagac catccttcct gctgctgccc aggatgttta 960 

ttaccgggat gagattggca atgtttctac cagccacctc cttattttgg atgactctgt 1020 

agagatggaa atccggcctc gcttccctct ctttggcggg tggaagaccc attacatcgt 1080 

tggctacaac ctcccaagct atgagtacct ctataatttg ggtgaccagt atgcactgaa 1140 

gatgaggttt gtggaccatg tgtttgatga acaagtgata gattctctga ctgtgaagat 1200 

catcctgcct gaaggagcca agaacattga aattgatagt ccctatgaaa tcagccgtgc 126 0 

cccagatgag ctgcactaca cctatctgga cacatttggc cgccctgtga ttgttgccta 1320 

caagaaaaat ctggtagaac agcacattca ggacattgtg ctggatgcac aggtcaagga 13 8 0 

gctggtgctg aagtcggcgg tggaggctga gcgcctggtg gctggcaagc tcaagaaaga 1440 

cacgtacatt gagaatgaga agctcatctc aggaaagcgc caggagctgg tcaccaagat 1500 

cgaccacatc ctggatgccc tgtagcccct gcccgcatcc tccagggggc ccagggtgcc 1560 

tgcactttgc tgtggcaggc agattgggtg gtagtgggag gttgtgcatg gaggccagtg 1620 

aaagctgaca tctgtaaaag gccttcaagg aagagaaacc aggccctgcg tcaggcagtg 1680 

tgagtttgcc gtttgtcctt aactttcttt tttttttttt ttaaaaaaag aaaactttaa 1740 

aaaaactccc attaaaaaca aaacatcttt gtgttttgaa caaaggaatt ttcaatattt 1800 

gattggtatt ctgttctgaa gtctaggata tttttcagcc tataaagccc cctgttttat 1860 

gcccttctaa ttctgatgtt tgggtattgt gtgagtgcat gtgttttttt tttttttttt 192 0 

taaagcgtgt gtgaacaaat ggaaataaag cagggactgt gaaca 1965 



<210> 102 
<211> 1193 
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<212> DNA 

<213> Homo sapien 



<400> 102 

agcggaatga acggctgctc agtggcggga gggcgggcta ccgcgcccgg gccctacccg 60 

ccctggcgca ctgctcttcc cggtcatgga ggcgccagcc gccggcttgt ttctgctcct 12 0 

gttgcttggg acttgggccc cggcgccggg cagcgcctcc tccgaggcac cgccgctgat 180 

caatgaggac gtgaagcgca cagtggacct aagcagccac ctggctaagg tgacggccga 24 0 

ggtggtcctg gcgcacctgg gcggcggctc cacgtcccga gctacctctt tcctgctggc 300 

tttggagcct gagctcgagg cccggctggc gcacctgggc gtgcaggtaa agggagaaga 3 60 

tgaggaagag aacaatttgg aagtacgtga aaccaaaatt aagggtaaaa gtgggagatt 420 

cttcacagtc aagctcccag ttgctcttga tcctggggcc aagatttcag tcattgtgga 480 

aacagtctac acccatgtgc ttcatccata tccaacccag atcacccagt cagagaaaca 540 

gtttgtggtg tttgagggga accattattt ctactctccc tatccaacga agacacaaac 600 

catgcgtgtg aagcttgcct ctcgaaatgt ggagagctac accaagctgg ggaaccccac 660 

gcgctctgag gacctactgg attatgggcc tttcagagat gtgcctgcct atagtcagga 720 

tacttttaaa gtacattatg agaacaacag ccctttcctg accatcacca gcatgacccg 780 

agtcattgaa gtctctcact ggggtaatat tgctgtggaa gaaaatgtgg acttaaagca 840 

cacaggagct gtgcttaagg ggcctttctc acgctatgat taccagagac agccagatag 900 

tggaatatcc tccatccgtt cttttaagac catccttcct gctgctgccc aggatgttta 960 

ttaccgggat gagattggca atgtttctac cagccacctc cttattttgg atgactctgt 1020 

agagatggaa atccggcctc gcttccctct ctttggcggg tggaagaccc attacatcgt 1080 

tggctacaac ctcccaagct atgagtacct ctataatttg ggtcagtctt caatagtaag 1140 

ggagaaacta accttttctt taatttcgtg aatttttttt ctttcaatta tea 1193 

<210> 103 

<211> 1685 

<212> DNA 

<213> Homo sapien 

<400> 103 

ageggaatga acggctgctc agtggcggga gggcgggcta ccgcgcccgg gccctacccg 60 

ccctggcgca ctgctcttcc cggtcatgga ggcgccagcc gccggcttgt ttctgctcct 120 

gttgcttggg acttgggccc cggcgccggg cagcgcctcc tccgaggcac cgccgctgat 180 

caatgaggac gtgaagcgca cagtggacct aagcagccac ctggctaagg tgacggccga 240 

ggtggtcctg gcgcacctgg gcggcggctc cacgtcccga gctacctctt tcctgctggc 300 
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tttggagcct gagctcgagg cccggctggc gcacctgggc gtgcaggtaa agggagaaga 


360 


tgaggaagag aacaatttgg 


aagtacgtga 


aaccaaaatt 


aagggtaaaa 


gtgggagatt 


420 


cttcacagtc aagctcccag ttgctcttga 


tcctggggcc 


aagatttcag 


tcattgtgga 


480 


aacagtctac acccatgtgc 


ttcatccata 


tccaacccag 


atcacccagt 


cagagaaaca 


540 


gtttgtggtg tttgagggga 


accattattt 


ctactctccc 


tatccaacga 


agacacaaac 


600 


catgcgtgtg aagcttgcct 


ctcgaaatgt 


ggagagctac 


accaagctgg 


ggaaccccac 


660 


gcgctctgag gacctactgg 


attatgggcc 


tttcagagat 


gtgcctgcct 


atagtcagga 


720 


tacttttaaa gtacattatg 


agaacaacag 


ccctttcctg 


accatcacca 


gcatgacccg 


780 


agtcattgaa gtctctcact 


ggggtaatat 


tgctgtggaa 


gaaaatgtgg 


acttaaagca 


840 


cacaggagct gtgcttaagg ggcctttctc 


acgctatgat 


taccagagac 


agccagatag 


900 


tggaatatcc tccatccgtt 


cttttaagac 


catccttcct 


gctgctgccc 


aggatgttta 


960 


ttaccgggat gagattggca 


atgtttctac 


cagccacctc 


cttattttgg atgactctgt 


1020 


agagatggaa atccggcctc 


gcttccctct 


ctttggcggg 


tggaagaccc 


attacatcgt 


1080 


tggctacaac ctcccaagct 


atgagtacct 


ctataatttg ggtgaccagt 


atgcactgaa 


1140 


gatgaggttt gtggaccatg 


tgtttgatga 


acaagtgata 


gattctctga 


ctgtgaagat 


1200 


catcctgcct gaaggagcca 


agaccacatc 


ctggatgccc 


tgtagcccct 


gcccgcatcc 


1260 


tccagggggc ccagggtgcc 


tgcactttgc 


tgtggcaggc agattgggtg gtagtgggag 


1320 


gttgtgcatg gaggccagtg 


aaagctgaca 


tctgtaaaag 


gccttcaagg 


aagagaaacc 


1380 


aggccctgcg tcaggcagtg 


tgagtttgcc 


gtttgtcctt 


aactttcttt 


tttttttttt 


1440 


ttaaaaaaag aaaactttaa 


aaaaactccc 


attaaaaaca 


aaacatcttt 


gtgttttgaa 


IjUU 


caaaggaatt ttcaatattt 


gattggtatt 


ctgttctgaa 


gtctaggata 


tttttcagcc 


1560 


tataaagccc cctgttttat 


gcccttctaa 


ttctgatgtt 


tgggtattgt 


gtgagtgcat 


1620 


gtgttttttt tttttttttt 


taaagcgtgt 


gtgaacaaat 


ggaaataaag 


cagggactgt 


1680 


gaaca 










1685 



<210> 104 

<211> 1182 

<212> DNA 

<213> Homo sapien 

<400> 104 

gggttgcgcc agggagccgg aaagccgact cccgaagttg gggtcctggg aaaacttggg 60 

tcctgggttg actgagaagc ggcggggaaa ggaggcgggc caggaggagg gggcctggcg 120 

gacgccggcc ggggggcggg gcgcggcggg gctgtcggtc acgcccctca gtccgccccg 180 
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ccccgccccg cctgccgggg aagggccacg ttgcccgccc ggccgtccgg ccccggcgcg 24 0 

ccgcagaaag ggctggcgag tcgaaaggcg aggcggccgc ggcagcgctt gggacgcgcc 300 

tgggcaccgg gctcgctccc tgcgccccgg agcaggccaa gttcggggcc aggacgtcgg 360 

gaggacctgg tgcatggctg cctcctaatc ccatagtcca gaggaggcat ccctaggact 420 

gcgggcaagg gagccgggca agcccagggc agccttgaac cgtcccctgg cctgccctcc 480 

ccggtggggg ccaggatgct gaagaagcag tctgcagggc ttgtgctgtg gggcgctatc 540 

ctctttgtgg cctgatacta accaaagggg cttccttgtc agggcctggt ggagttggtg 600 

ggtcatcggg gctcactgcc tcctgccctt ctctcctgtc tgacccccac ttagcccttc 660 

tctccttgca gcctagcagt ttatagttct gagatggaaa gttgaagggg gcaagcaaga 720 

cctctcctca gcccatgccc agctgtcagg agagaggtgc agggaggaag gccttgtgct 780 

gggacaacct ctctcttgcc ttacctcaga gagggactat gccctgaccc ctcctttctg 840 

aaaatcagtg ccctccctgt tgctctagga ggctcctgct ggcttggtag aagacagaat 900 

tcgatctgcc tgtccctttt tcccctgggg tttgacacac aggctcctct cagcatgagg 960 

tggagcagtg accaggtgga gcagtgacca ggacgcctct ggcccagtgc tgcccagcct 1020 

ccccgcccgc tcccaggcgc cccatgtcct cacaggccag gacgccatgg caggatggag 1080 

aggacttggt ggatttttgt ttcttgcctg acctcagttt catgaaagaa agtggaagct 114 0 

acagaattat tttctaaaat aaaggctgaa ttgtctgaaa aa H82 

<210> 105 

<211> 2645 

<212> DNA 

<213> Homo sapien 



60 



<400> 105 

agcgggcgcg ctccgctcgg cagcctgtgg gacgcgaccg cggcgctagt ctcgttcctt 

tgtgctgcgg cggcggcttc tcgagtcctc cccgacgcgt cctctaggcc agcgagcccc 120 

gcgctctccg gtgacggacc atgtcggcgg cgggagcggg cgcgggcgta gaggcgggct 180 

tctccagcga ggagctgctc tcgctccgtt tcccgctgca ccgcgcctgc cgcgacgggg 24 0 

acctggccac gctctgctcg ctgctgcagc agacacccca cgcccacctg gcctctgagg 3 00 

actccttcta tggctggacg cccgtgcact gggccgcgca tttcggcaag ttggagtgct 360 

tagtgcagtt ggtgagagcg ggagccacac tcaacgtctc caccacacgg tacgcgcaga 420 

cgccagccca cattgcagcc tttgggggac atcctcagtg cctggtctgg ctgattcaag 480 

caggagccaa cattaacaaa ccggattgtg agggtgaaac tcccattcac aaggcagctc 540 

gctctgggag cctagaatgc atcagtgccc ttgtggcgaa tggggctcac gtcgacctga 600 
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gaaatgccag 


tggectgaca gcagcagaca ttgeacaaac 


ccagggtttc 


caagagtgtg 


660 


cccagtttct 


cttgaacctc 


cagaattgtc atctgaacca 


tttctataac 


aatggcatct 


720 


taaatggggg 


tcatcagaat 


gtatttccta atcatattag 


tgtgggaaca 


aatcgaaaga 


780 


gatgcttgga 


agactcagaa gactttggag taaagaaagc 


tagaactgaa 


gctcaaagct 


840 


tggattctgc 


cgtgccactc acgaatggcg acacagaaga 


egatgetgae 


aaaatgcacg 


900 


ttgataggga 


gtttgctgtt 


gtaacaggtg ggagtggaca 


gtttcctgtt 


agetgeaaca 


960 


acaatccaat 


ggttgaagac 


accaaacagc aggagagtgg 


ttctgttgga 


ccaaaagaaa 


1020 


tagaaatata 


tactgtgtca geaatgeaga ccccctgtcg 


ttgcaggaat 


caatatgaaa 


1080 


aacagtagct 


ccgtatcgaa 


tacattgaca aatggatgtg 


tcatcaatgg acatttggac 


1140 


ttcccctcca 


cgaccccgct 


cagtgggatg gaaagcagga atggccagtg 


cttgacagga 


1200 


actaacggaa 


ttagcagtgg attagcccca ggacagccgt 


ttccgagtag 


ccagggttct 


1260 


cfcctgcatta 


gtgggactga ggagecagag aagaccctga gagctaaccc 


tgagttgtgc 


1320 


ggttctctgc 


acctgaaegg gagtccaagt agetgeatag 


ccagtaggcc 


ttcctgggtg 


1380 


gaagacat t g 


gggataacct 


gtactatgga cactaccacg 


ggtttgggga 


cactgetgaa 


1440 


agcatcccag 


aactgaacag 


tgtggtcgag cattccaagt 


ccgtgaaggt 


geaggagegg 


1500 


tacgacagtg 


ccgtgctggg 


caccatgcac ctgcaccacg 


gctcctagag 


acgctgacct 


1560 


ggctctcgga 


aacgeaggag 


tccttcctgg tagccagctc 


agaataccca 


tgtagcagca 


1620 


acttgaacga 


atgtcacaac 


ttgtactttt tttatatact 


tcaactttct 


gaaaaagtaa 


1680 


acttcgacaa 


gttcccagca 


actgcttgtt tgtgcatgag 


tagggcttac 


taagtgcata 


1740 


gatgtttcta 


cagtgaggtg 


tcctttttat aaggtgcact 


tttggagttt 


ttctgatgee 


1800 


aatctcaaca 


ttgtcttttt 


aatactgtca ccagatattg 


ccatttttct 


ttttgttaaa 


1860 


agattatatg 


atcaagataa 


attggggtgg taaatcaggt 


gcctggtaat 


ttatctcttt 


1920 


gcacatgggc 


atcattttaa 


aaagcttget tccactcttt 


tctgtagaat 


ttgaeggaac 


1980 


acagctattt 


ccctatgcaa 


ggtacagect tacaaagatt 


tctgcagtga 


tttgtgtgaa 


2040 


gaagagaatg 


tttgtctttt 


tcaatgaagc tttgeagate 


accatgtggt 


tgaaggtttt 


2100 


agttgtggac 


acagtggtcc 


ctccttaatg atgaagatca 


ctgccttggg cttcatggaa 


2160 


aacaggccca 


gectgggget 


gcgtttggat ttattgtttt 


tattccacac 


ttcctacttg 


2220 


gtctctggaa gttttaccac atgtaacaga ttcctttata 


tgtagtgaaa 


atcactattt 


2280 


gtagaaactg 


tcaggtcaaa 


atatttaact gactgttgac 


atgtattttc 


ttttttcctt 


2340 


gtttttgttt 


tttggggttt 


tetgetttaa gatatatacc 


actatgtata 


tccagttaac 


2400 


tgagagaatt 


ttgactctct 


taataaaact gcattaagtt 


tttgattttg 


tagaaattag 


2460 
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cttttgtcta ggcaactagt ggttatactc tgcaaatatt gtaatgaatt tttacttttt 2520 

tgatttttgt aataaaaatt ggtgcagata aaatgtcaaa tgaacaaacc agtgttctaa 2580 

gagtgttact aacattttgt tctaaaactg tccttcacaa attgaataaa aaactctcac 2640 

actca 2645 

<210> 106 

<211> 2439 

<212> DNA 

<213> Homo sapien 

<400> 106 

gcgcgcgcat gttagggtaa tccatttttc tgtttgtttc tgaattgctt tggagaaggc 60 

aactcagact ccaaggaata caacagctgc cgtgtgaagc ttttctcact tttccagtgg 120 

tcgtagcagg aacagcgtct tgacctcaca tcagtgacgt gaaaagagct gagtttgaag 180 

tgcatattgg tcctccataa agaatgtgtg tatgtgctgt ttttttaatt aagtctggta 24 0 

atttatttga tacatgtatt gtagttttag ttgaattaaa ctagttagta aaactgttgg 3 00 

tgaggtagtg tttcagttac tctgtagata agctcttttg ttaaaacctc ttaagttttt 360 

tgttcctgaa tatgttacaa acatctattt tctcttactg tagctatcac atgaggaaaa 420 

aaacttgaaa aaaagacttc atagtgtgtg tttatgtttg attttagaag acccttttga 480 

ggacttcttt gggaatcgaa ggggtccccg aggaagcaga agccgaggga cggggtcgtt 54 0 

tttctctgcg ttcagtggat ttccgtcttt tggaagtgga ttttcttctt ttgatacagg 600 

atttacttca tttgggtcac taggtcacgg gggcctcact tcattctctt ccacgtcatt 660 

tggtggtagt ggcatgggca acttcaaatc gatatcaact tcaactaaaa tggttaatgg 720 

cagaaaaatc actacaaaga gaattgtcga gaacggtcaa gaaagagtag aagttgaaga 780 

agatggccag ttaaagtcct taacaataaa tggtgtggcc gacgacgatg ccctcgctga 840 

ggagcgcatg cggagaggcc agaacgccct gccagcccag cctgccggcc tccgcccgcc 900 

gaagccgccc cggcctgcct cgctgctgag acacgcgcct cactgtctct ctgaggagga 960 

gggcgagcag gaccgacctc gggcacccgg gccctgggac cccctcgcgt ccgcagcagg 1020 

attgaaagaa ggtggcaaga ggaagaagca gaagcagaga gaggagtcga agaagaagaa 1080 

gtcgaccaaa ggcaatcact agaccggact tgaggcacgc ggtgcacccc cagacgctgg 1140 

cgctccaccg tgctcggcat gcggtcgtgc acacgcgcta ggtagcagcg tcggtcagga 12 00 

ctgtctcgag gccacactcg ctcggcagga ttatgcgatc acggatcagt cagagcaggg 1260 

tcaggagacg gggctgacgg cacgggtggc ggggacagac gtttgggact tggccgcgac 1320 

tctctgcttc tctccagctc tcaatctgct gcattttcct ctagtgcttc cggatcctct 13 80 
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tcattctttt cggctactca accactccgc atgctgctgg aatatttctg gctttagaag 
tacaggaggg cgcagatggc taactgagta acattcatga aatgaggctt tctgtggcgg 
cgtagtgttt ggaattagaa ggtaattcag tagagtgtaa cttagagaat attgcaagtg 
acacattgaa tcctgcccgt cagggcacct tttcctcaga gcaatccggc cacacgaata 
gaaggctgtc gtgaatcaca tcagatgtaa aatcattcct tctgtttact cttttaattt 
tcatcctttg caggtagtgc aaattcaact tcaaatatgg tgtaggtttt gctagattcc 
atattttttt cttggatttt tgctaattat ttttagcaaa aaatttttgc tcagtggcac 
cctccctagt gtccatgggt tagggccatg ctggggaaaa cgggccggta tttacacacg 
cgcaaaacac ccagagacgg cacaaggagg ttgaactcat gtttcagttc gcgaacattg 
actccttacg aaagtcactt cattctaact agatgcgccc acttccggtc attatttcgt 
ttgcatgatg tattgcttct tcacgttttg tttttattga gcacggagta gaattccagg 
gctgccttga cttcttccct gcatgctccc tcccagtgac tttccttccc tttcacatga 
ggatctgccg ttcatgttgc tttctccttt gtcctcttgg acttgagggc attgtgaaaa 
gctttgctgt gatttaaaaa tgccagcaat tttaatctag cagtgttgaa gctgggaatt 
ttttggcgca atccatgtag cagtgaccca ggcttgggag ccagaaacaa gtgtgacctg 
ggattttatt taacacaact gttgccaaag agttggcttt gtttatttgg ttttggcggg 
gagaggagtg gtatttgatg ctttctgtgg acaatgtaac cctaaacaca tcatgtattt 
taaatgccac ctacataaat aaaacataag catattgaa 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2439 



<210> 107 

<211> 1958 

<212> DNA 

<213> Homo sapien 

<400> 107 

ggcagggcct caggccccca agagggctgt gagcaggcac cttcctgtga ggcacgatgc 
caccatgtga agttgtgtgg gtctttggag agttctggca ggttgagggc ctgtgcgtgt 
gggacccagg cctgccctgc ctgctgccgg ccatgttccc tctgcccagt catgttgaca 
attgtggcca aattcaacat cgggcctgtt tcctgcaccc acgttggcac acactgagtt 
caagacagtt tctgcagaaa tgaatccaga ggtgtttatt tctggcacgt tggatcagac 
gttcagaggg tgggcgtctt tctccatgca cttcagctgg atcagtcagt gtcagccatt 
gactttagca acagggaaat gaaagacttt cctcaataag aaatgggact ttctcacgct 
ggatggcaca gagctggcaa acatgaggcc agtccacacc aaggctttgc ctgcagaaag 
gctgcactgt ggccagcggg ggaagcggag gagacacctg tggacaccct ccccacagga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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ttgaaagaag gtggcaagag gaagaagcag aagcagagag aggagtcgaa gaagaagaag 600 

tcgaccaaag gcaatcacta gaccggactt gaggcacgcg gtgcaccccc agacgctggc 660 

gctccaccgt gctcggcatg cggtcgtgca cacgcgctag gtagcagcgt cggtcaggac 72 0 

tgtctcgagg ccacactcgc tcggcaggat tatgcgatca cggatcagtc agagcagggt 780 

caggagacgg ggctgacggc acgggtggcg gggacagacg tttgggactt ggccgcgact 840 

ctctgcttct ctccagctct caatctgctg cattttcctc tagtgcttcc ggatcctctt 900 

cattcttttc ggctactcaa ccactccgca tgctgctgga atatttctgg ctttagaagt 960 

acaggagggc gcagatggct aactgagtaa cattcatgaa atgaggcttt ctgtggcggc 1020 

gtagtgtttg gaattagaag gtaattcagt agagtgtaac ttagagaata ttgcaagtga 1080 

cacattgaat cctgcccgtc agggcacctt ttcctcagag caatccggcc acacgaatag 1140 

aaggctgtcg tgaatcacat cagatgtaaa atcattcctt ctgtttactc ttttaatttt 1200 

catcctttgc aggtagtgca aattcaactt caaatatggt gtaggttttg ctagattcca 1260 

tatttttttc ttggattttt gctaattatt tttagcaaaa aatttttgct cagtggcacc 1320 

ctccctagtg tccatgggtt agggccatgc tggggaaaac gggccggtat ttacacacgc 1380 

gcaaaacacc cagagacggc acaaggaggt tgaactcatg tttcagttcg cgaacattga 144 0 

ctccttacga aagtcacttc attctaacta gatgcgccca cttccggtca ttatttcgtt 15 00 

tgcatgatgt attgcttctt cacgttttgt ttttattgag cacggagtag aattccaggg 1560 

ctgccttgac ttcttccctg catgctccct cccagtgact ttccttccct ttcacatgag 1620 

gatctgccgt tcatgttgct ttctcctttg tcctcttgga cttgagggca ttgtgaaaag 1680 

ctttgctgtg atttaaaaat gccagcaatt ttaatctagc agtgttgaag ctgggaattt 1740 

tttggcgcaa tccatgtagc agtgacccag gcttgggagc cagaaacaag tgtgacctgg 1800 

gattttattt aacacaactg ttgccaaaga gttggctttg tttatttggt tttggcgggg 1860 

agaggagtgg tatttgatgc tttctgtgga caatgtaacc ctaaacacat catgtatttt 192 0 

aaatgccacc tacataaata aaacataagc atattgaa 1958 

<210> 108 

<211> 3400 

<212> DNA 

<213> Homo sapien 



<400> 108 

ccgccccgtc gcgccagcgt gcggagaaac ctggcgccgc gaccagtgta tcgggggtct 



tggaaatggc taagaaagct ggcctctgcc taggaggctc tcggcagggg ggctgccaat 



ccgggatggt taccggaaat gaaccgcgcg atctggccct ttcacaccca cttagctttg 



60 
120 
180 
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tagggggttt 


gtagtcttta 


caggacaaaa 


gggcacctat 


tccagagcac 


accccctttt 


240 


cttcctctcc 


cttctgtgcc 


tcactgcttt 


ccgatctcat 


agttgctccc 


agtaagtgct 


300 


ccaaagtcgt 


gtggggccgc 


atctgaggtc 


tgcccgcctg aattgggttg 


gaggggttca 


360 


tgttctgaga 


aggcgaaggc 


cagggtgtag 


gatggtgccc 


cgtgactttc 


cagtctttgt 


420 


aggcgtgcct 


ggtatcttca 


tcgtaatagc 


agagtcgtca 


actttctatg 


aaaggtggtt 


480 


ttaccttgat 


aggtggtagt 


ggtggcctaa 


tttgtatcat 


tgtcatgggt 


ttagtttcca 


540 


tattctagtt 


agcagctgct 


actgttctct 


tgcaagtgta 


taatgtgaaa 


taataaacgt 


600 


gaagacttct 


gaataaggtt 


attctgtgtc 


tttcatagta gaaaccttaa 


tgatcggtct 


660 


gttgtagtga 


aactctttaa 


aaaggcgcta 


tagaaaacca 


atttctgagt 


aaaccagcag 


720 


acagcatgac 


ttgtaaatgg 


tcttttaatt 


aattaaaaag 


aaattagtca 


gctacaagca 


780 


tgaacatgtg 


gaacgcttac 


ctttgtacta 


ggcgtttttg 


tttttgtttt 


aatggctttt 


840 


ggaatattat 


agtattaaca 


tctggaaaac 


taggtaaatt 


tatcttagaa 


ttaagttttt 


900 


tgctcctttt 


ttgcagaaaa 


agaacagcaa 


gaagcgattg 


aacacattga 


tgaagtacaa 


960 


aatgaaatag 


acagacttaa 


tgaacaagcc 


agtgaggaga 


ttttgaaagt 


agaacagaaa 


1020 


tataacaaac 


tccgccaacc 


attttttcag 


aagaggtcag 


aattgatcgc 


caaaatccca 


1080 


aatttttggg 


taacaacatt 


tgtcaaccat 


ccacaagtgt 


ctgcactgct 


tggggaggaa 


1140 


gatgaagagg 


cactgcatta 


tttgaccaga 


gttgaagtga 


cagaatttga 


agatattaaa 


1200 


tcaggttaca 


gaatagattt 


ttattttgat 


gaaaatcctt 


actttgaaaa 


taaagttctc • 


1260 


tccaaagaat 


ttcatctgaa 


tgagagtggt 


gatccatctt 


cgaagtccac 


cgaaatcaaa 


1320 


tggaaatctg 


gaaaggattt 


gacgaaacgt 


tcgagtcaaa 


cgcagaataa 


agccagcagg 


1380 


aagaggcagc 


atgaggaacc 


agagagcttc 


tttacctggt 


ttactgacca 


ttctgatgca 


1440 


ggtgctgatg 


agttaggaga 


ggtcatcaaa 


gatgatattt 


ggccaaaccc 


attacagtac 


1500 


tacttggttc 


ccgatatgga 


tgatgaagaa 


ggagaaggag 


aagaagatga 


tgatgatgat 


1560 


gaagaggagg 


aaggattaga 


agatattgac 


gaagaagggg 


atgaggatga 


aggtgaagaa 


1620 


gatgaagatg 


atgatgaagg 


ggaggaagga 


gaggaggatg 


aaggagaaga 


tgactaaata 


1680 


gaacactgat 


ggattccaac 


cttccttttt 


ttaaattttc 


tccagtccct 


gggagcaagt 


1740 


tgcagtcttt 


tttttttttt 


tttttttttt 


ccctcttgtg 


ctcagtcgcc 


ctgttcttga 


1800 


ggtctctttt 


ctctactcca 


tggttetcaa 


tttatttggg 


gggaaatacc 


ttgagcagaa 


1860 


tacaatggga 


aaagagtctc 


tacccctttc 


tgttcgaagt 


tcatttttat 


cccttcctgt 


1920 


ctgaacaaaa 


actgtatgga 


atcaacacca 


ccgagctctg 


tgggaaaaaa 


gaaaaacctg 


1980 
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ctcccttcgc tctgctggaa gctggagggt gctaggcccc tgtgtagtag tgcatagaat 
tctagctttt ttcctccttt ctctgtatat tgggctcaga gagtacactg tgtctctatg 
tgaatatgga cagttagcat ttaccaacat gtatctgtct actttctctt gtttaaaaaa 
agaaaaaaaa acttaaaaaa atggggttat agaaggtcag caaagggtgg gtttgagatg 
tttgggtggg ttaagtgggc attttgacaa catggcttct cctttggcat gtttaattgt 
gatatttgac agacatcctt gcagtttaag atgacacttt taaaataaat tctctcctaa 
tgatgacttg agccctgcca ctcaatggga gaatcagcag aacctgtagg atcttatttg 
gaattgacat tctctattgt aattttgttc ctgtttattt ttaaattttc tttttgtttc 
actggaaagg aaagatgatg ctcagtttta aacgttaaaa gtgtacaagt tgctttgtta 
caataaaact aaatgtgtac acaaaggatt tgatgctttt ctctcagcat aggtatgctt 
actatgacct tccaagtttg acttgtataa catcactgtc aaactttgtc accctaactt 
cgtatttttt gatacgcact ttgcaggatg acctcagggc tatgtggatt gagtaatggg 
atttgaatca atgtattaat atctccatag ctgggaaacg tgggttcaat ttgccattgg 
tttctgaaag tattcacatc atttgggata ccagatagct caatactctc tgagtacatt 
gtgcccttga tttttatctc caagtggcag tttttaaaat tggcctttta cctggatata 
aattaattgt gcctgccacc accatccaac agacctggtg ctctaatgcc aagttataca 
cgggacagtt gctggcatgt cttcattggc tatataaaat gtggccaaga agataggctc 
tcagtaagaa gtctgatggt gagcagtaac tgtccctgct ttctggtata aagctctcaa 
atgtgaccat gtgaatctgg gtgggataat ggactcagct ctgtctgctc aatgccattg 
tgcagagaag caccctaatg cataagcttt ttaatgctgt aaaatatagt cgctgaaatt 
aaatgccact ttttcagagg tgaattaatg gacagtctgg tgaacttcaa aagctttttg 
atgtataaaa cttgataaat ggaactattc catcaatagg caaaagtgta acaacctatc 
tagatggata gtatgtaatt tctgcacagg tctctgttta gtaaatacat cactgtatac 
cgatcaggaa tcttgctcca ataaaggaac ataaagattt 



2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3400 



<210> 109 

<211> 2775 

<212> DNA 

<213> Homo sapien 

<400> 109 

ccgccccgtc gcgccagcgt gcggagaaac ctggcgccgc gaccagtgta tcgggggtct 
tggaaatggc taagaaagct ggcctctgcc taggaggctc tcggcagggg ggctgccaat 
ccgggatggt taccggaaat gaaccgcgcg atctggccct ttcacaccca cttagctttg 



60 
120 
180 
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tagggggttt gtagtcttta caggacaaaa gggcacctat tccagagcac accccctttt 240 

cttcctctcc cttctgtgcc tcactgcttt ccgatctcat agttgctccc aaaaaagaac 300 

agcaagaagc gattgaacac attgatgaag tacaaaatga aatagacaga cttaatgaac 360 

aagccagtga ggagattttg aaagtagaac agaaatataa caaactccgc caaccatttt 42 0 

ttcagaagag gtcagaattg atcgccaaaa tcccaaattt ttgggtaaca acatttgtca 480 

accatccaca agtgtctgca ctgcttgggg aggaagatga agaggcactg cattatttga 540 

ccagagttga agtgacagaa tttgaagata ttaaatcagg ttacagaata gatttttatt 600 

ttgatgaaaa tccttacttt gaaaataaag ttctctccaa agaatttcat ctgaatgaga 66 0 

gtggtgatcc atcttcgaag tccaccgaaa tcaaatggaa atctggaaag gatttgacga 720 
aacgttcgag tcaaacgcag aataaagcca gcaggaagag gcagcatgag gaaccagaga 78 0 
gcttctttac ctggtttact gaccattctg atgcaggtgc tgatgagtta ggagaggtca 84 0 
tcaaagatga tatttggcca aacccattac agtactactt ggttcccgat atggatgatg 900 
aagaaggaga aggagaagaa gatgatgatg atgatgaaga ggaggaagga ttagaagata 960 

ttgacgaaga aggggatgag gatgaaggtg aagaagatga agatgatgat gaaggggagg 102 0 

aaggagagga ggatgaagga gaagatgact aaatagaaca ctgatggatt ccaaccttcc 1080 

tttttttaaa ttttctccag tccctgggag caagttgcag tctttttttt tttttttttt 1140 

tttttccctc ttgtgctcag tcgccctgtt cttgaggtct cttttctcta ctccatggtt 12 00 

ctcaatttat ttggggggaa ataccttgag cagaatacaa tgggaaaaga gtctctaccc 1260 

ctttctgttc gaagttcatt tttatccctt cctgtctgaa caaaaactgt atggaatcaa 1320 

caccaccgag ctctgtggga aaaaagaaaa acctgctccc ttcgctctgc tggaagctgg 13 8 0 

agggtgctag gcccctgtgt agtagtgcat agaattctag cttttttcct cctttctctg 1440 

tatattgggc tcagagagta cactgtgtct ctatgtgaat atggacagtt agcatttacc 1500 

aacatgtatc tgtctacttt ctcttgttta aaaaaagaaa aaaaaactta aaaaaatggg 1560 

gttatagaag gtcagcaaag ggtgggtttg agatgtttgg gtgggttaag tgggcatttt 1620 

gacaacatgg cttctccttt ggcatgttta attgtgatat ttgacagaca tccttgcagt 1680 

ttaagatgac acttttaaaa taaattctct cctaatgatg acttgagccc tgccactcaa 174 0 

tgggagaatc agcagaacct gtaggatctt atttggaatt gacattctct attgtaattt 1800 

tgttcctgtt tatttttaaa ttttcttttt gtttcactgg aaaggaaaga tgatgctcag 1860 

ttttaaacgt taaaagtgta caagttgctt tgttacaata aaactaaatg tgtacacaaa 1920 
ggatttgatg cttttctctc agcataggta tgcttactat gaccttccaa gtttgacttg 1980 
tataacatca ctgtcaaact ttgtcaccct aacttcgtat tttttgatac gcactttgca 2040 



WO 2004/050900 

ggatgacctc agggctatgt ggattgagta 
catagctggg aaacgtgggt tcaatttgcc 
ggataccaga tagctcaata ctctctgagt 
ggcagttttt aaaattggcc ttttacctgg 
ccaacagacc tggtgctcta atgccaagtt 
ttggctatat aaaatgtggc caagaagata 
gtaactgtcc ctgctttctg gtataaagct 
ataatggact cagctctgtc tgctcaatgc 
gctttttaat gctgtaaaat atagtcgctg 
taatggacag tctggtgaac ttcaaaagct 
tattccatca ataggcaaaa gtgtaacaac 
acaggtctct gtttagtaaa tacatcactg 
ggaacataaa gattt 

<210> 110 

<211> 4476 

<212> DNA 

<213> Homo sapien 

<400> 110 

catcttgcag ccggcggcgc ggattgaatg 
cagcgcaggc cgcgagccgc cgccaccatg 
cccgccacgc cgctgtcgcc cacgcgcctg 
gagctcaacg accgcctggc gcactacatc 
gaccggctcc tgctcaagat ctcagagaag 
atcaaggcgc tgtacgagtc ggagctggcc 
cgagagcgtg cccggctgca gatagagatt 
aacaagagcg ccaagaagag ggagggcgag 
ctggagtccc tgttccaccg gagcgaggtg 
ggcctggaga gtgacgtggc tgagctgcgg 
gcagtggcca aaaagcagct ggagaaggag 
tgccagagcc tgcaggagga gctggacttc 
gagacgcggc ggcggcacga gcggcgcctg 
tacgacttca agatggcaca ggcgctggag 
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atgggatttg aatcaatgta ttaatatctc 2100 

attggtttct gaaagtattc acatcatttg 2160 

acattgtgcc cttgattttt atctccaagt 2220 

atataaatta attgtgcctg ccaccaccat 2280 

atacacggga cagttgctgg catgtcttca 2340 

ggctctcagt aagaagtctg atggtgagca 2400 

ctcaaatgtg accatgtgaa tctgggtggg 2460 

cattgtgcag agaagcaccc taatgcataa 2520 

aaattaaatg ccactttttc agaggtgaat 258 0 

ttttgatgta taaaacttga taaatggaac 264 0 

ctatctagat ggatagtatg taatttctgc 2700 

tataccgatc aggaatcttg ctccaataaa 2760 

2775 



agcccgccga gcccgggccg ccgtcgggag 60 

gccacgccgc tgcccggccg cgcgggcggg 12 0 

tcgcggctgc aggagaagga ggagctgcgc 180 

gaccgcgtcc gcgcgctgga gctggagaac 240 

gaggaggtga ccacgcgcga ggtgagtggc 300 

gatgcccgga gagtcctgga tgagacggct 360 

gggaagctga gggcagagtt ggacgaggtc 420 

cttacggtgg cccagggccg tgtgaaggac 480 

gagctggcag ctgccctcag cgacaagcgc 540 

gcccagctgg ccaaggccga ggacggtcat 600 

acgctgatgc gtgtggacct ggagaaccgc 660 

cggaagagtg tgttcgagga ggaggtgcgg 72 0 

gtggaggtgg acagcagccg gcagcaggag 780 

gagctgcgga gccagcacga cgagcaagtg 840 
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cggctctaca agctggagct ggagcagacc taccaggcca agctggacag cgccaagctg 900 

agctctgacc agaacgacaa ggcggccagt gcggctcgcg aggagctgaa ggaggcccgc 960 

atgcgcctgg agtccctcag ctaccagctc tccggcctcc agaagcaggc cagtgccgct 1020 

gaagatcgca ttcgggagct ggaggaggcc atggccgggg agcgggacaa gttccggaag 1080 

atgctggacg ccaaggagca ggagatgacg gagatgcggg acgtgatgca gcagcagctg 1140 

gccgagtacc aggagctgct ggacgtgaag ctggccctgg acatggagat caacgcctac 12 00 

cggaagctcc tggagggcga ggaggagagg ctgaagctgt cccccagccc atcctcgcgc 1260 

gtcaccgtct cacgagccac ctcgagcagc agcggcagct tgtccgccac cgggcgcctg 1320 

ggccgcagta agcggaagcg gctggaggtg gaggagccct tgggcagcgg cccaagcgtc 1380 

ctgggcacgg gcacgggtgg cagcggtggc ttccacctgg cccagcaggc ctcggcctcg 1440 

ggtagcgtca gcatcgagga gatcgacctg gagggcaagt ttgtgcagct caagaacaac 1500 

tcggacaagg atcagtctct ggggaactgg agaatcaaga ggcaggtctt ggagggggag 1560 

gagatcgcct acaagttcac gcccaagtac atcctgcgcg ccggccagat ggtcacggtg 1620 

tgggcagctg gtgcgggggt ggcccacagc cccccctcga cgctggtgtg gaagggccag 1680 

agcagctggg gcacgggcga gagcttccgc accgtcctgg ttaacgcgga tggcgaggaa 1740 

gtggccatga ggactgtgaa gaagtcctcg gtgatgcgtg agaatgagaa tggggaggaa 1800 

gaggaggagg aagccgagtt tggcgaggag gatcttttcc accaacaggg ggacccgagg 1860 

accacctcaa gaggctgcta cgtgatgtga acccacactc ctcatccaca cacctttctt 1920 

tacccagagc cactgaaaac tatttttata tcattggctt tctttagttc ttgatacatt 1980 

tctagagaat ttctaagcga actgccagaa cgtgtgggtg ggtctccccc agccctccct 2040 

cctggcgggt ctcctccagc ctcacttcgc tgccacttcg ccgctgcccc ggagactttt 2100 

caatcccacc ccactcctca tctcaccatt tggtcaaatt ggaagcccag ggccaggacc 2160 

cggaggttta gaagatgctt gggcttggag ggaggagggc cggcgaggct agcgagggga 2220 

caggagacgg ccctgctgcg gacggagcgc ggaaactgcg taggaattca gtggtggtgg 2280 

gtttttttaa ggctttctac aaaaccaaat tcagaatcca ggcgtcgacc tggtggggcc 2340 

cggggccaag cctgcattct ggctgcccag cttcggacag cgggaactcc tcaggcagcc 2400 

acgcagcggg tgtgggccag catggggatg gcgtggcccc cagggcgggt tttcactccg 2460 

ctgcctgggc ttccagattc ccgttctggc agcgcaccgg ccgggtttct cggaccgttg 2520 

actttatttg ggggagtttt cccgcagttc agttcctgac tgtgcaaggc caacagggca 2580 

ggggagggga agacctgggg aaggaagaat gaggacagtc ccgtcgtaag acctgtcaca 264 0 
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acaataagca gggaggggag atgtggaggg gacacatctg gttgccttgg aggcagaagc 2700 

tgtgagtttc agaacagctg tctgcaggga acgccaccat gttgaccctc tggaggagag 2760 

cgctgtggag cccctcccgt gttccagctc cgtctgccct gtgcctatat atacacatgc 2820 

gtctatccat actgtgcttt tatctgtgat tttctcgctg aaaccatgtt tctcagacag 2880 

gccaaggcca cctgactcct atcacgacgc acccaagccc ctcagtccag cttcccaatg 2940 

cctggcaccc cccttcggca atagctcacc gtttacaccc tccctcatag atacacagaa 3000 

gttatttttt taatggatat ttattttttt acattggtca gtacacaggt caggagctca 3060 

cgccagggcc ttgaggacag gctgaccctc ctccccgggg tggcgtgggg ctggggcacc 3120 

ctccgacggc agagcctcct tcagaaagtg cagctcaagt cttaaagaca ccaaaactga 3180 

gccatgggca cgcgccgtct ccgggccatg gcgttcactg cagggcgggg gcggcaccgc 3240 

tccccatgta gatagagggg ggcgcatctt accaaagcat ttcctcctgg aggctacgcc 3300 

gctgtgctcc cagtcaggcg gctggtaggg agctttgcct gccccgggga taccctctgc 33 60 

cagccgctgg aagtgggaat gctggcgaca gactgtgtcc tctttcccac cttcatagca 3420 

ggaatcaccc ggacccgact ggctgggctt cgtgctagcg agggttctgg gggtgggtct 3480 

tggtgatctt gtcctatggg gagtctgcag tggtctcagc cacatcctat gtattttggc 3540 

tctggaggag caaagctgta tcctggagtt ggtctgtgat ttgccgacag ccttgcaggc 3600 

tgggctcagg gacaaagtcc cccccaaaac ccgcaggtcc tcatgtccag acgctgccca 366 0 

gtcctgtcct gaaaacagca cgccccaggc ccacagaacc ccccacccta catttgcctt 372 0 

gggtggagct gggggtggtc ctaggactgc gggtgccctt agctgaaggg ggtggggaga 378 0 

agcgtggact gggcagcctg tgggtaattg gaggttcatt gagaattgag tctttggaaa 3 84 0 

cactaagaaa atcaaatttt taaaagttat ttatggcctg ggaaacaatt tgcatttgtc 3900 

cccaaatacg cttagctgtg tgccgcttag aacgatgcga aaccatccct ctgtgtaagc 3960 

ccgtgccgtg tgactcgaag cctagcgccc tccctgcgaa gcatcagacg ccacccagcc 402 0 

ctggggggag gcccacgcct gctggaccaa cgcgggttct ggggtgcaca gcgccaggtt 4080 

aacgctgaag cctgccccgc tgagcccagg agccgggagg cctgcgggct gacccagaat 414 0 

ccgatcatgc acctgtcctc atgccagcgg ctttggctgg ggttggtctg aagcctgcac 4200 

gcggcagttc tttgttaaag atctgaggga ctcctcagtc ctggggcgtc gccgcctgca 4260 

gcctcttcca agccctgcgt ccagcgagcg tcacagcaca acctgcaaaa acggagctgg 4320 

gctgcagctg gggctggcat ggactttcat ttcagagatt cggtttttaa gaagatgcat 4380 

gcctaatgtg ttcttttttt tttccaatga tttgtaatat acattttatg actggaaact 4440 

tttttgtaca acactccaat aaacattttg atttta 4476 
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<210> 111 

<211> 1887 

<212> DNA 

<213> Homo sapien 

<400> 111 

gcctggtggc gaattcggca cgaggatgag cccgccgagc ccgggccgcc gtcgggagca 60 

gcgcaggccg cgagccgccg ccaccatggc cacgccgctg cccggccgcg cgggcgggcc 120 

cgccacgccg ctgtcgccca cgcgcctgtc gcggctgcag gagaaggagg agctgcgcga 180 

gctcaacgac cgcctggcgc actacatcga ccgcgtccgc gcgctggagc tggagaacga 240 

ccggctcctg ctcaagatct cagagaagga ggaggtgacc acgcgcgagg tgagtggcat 300 

caaggcgctg tacgagtcgg agctggccga tgcccggaga gtcctggatg agacggctcg 360 

agagcgtgcc cggctgcaga tagagattgg gaagctgagg gcagagttgg acgaggtcaa 420 

caagagcgcc aagaagaggg agggcgagct tacggtggcc cagggccgtg tgaaggacct 480 

ggagtccctg ttccaccgga gcgaggtgga gctggcagct gccctcagcg acaagcgcgg 54 0 

cctggagagt gacgtggctg agctgcgggc ccagctggcc aaggccgagg acggtcatgc 600 

agtggccaaa aagcagctgg agaagggctg tccttgctct cagaagtccc ggtcccatgt 660 

agatagaggg gggcgcatct taccaaagca tttcctcctg gaggctacgc cgctgtgctc 720 

ccagtcaggc ggctggtagg gagctttgcc tgccccgggg ataccctctg ccagccgctg 780 

gaagtgggaa tgctggcgac agactgtgtc ctctttccca ccttcatagc aggaatcacc 84 0 

cggacccgac tggctgggct tcgtgctagc gagggttctg ggggtgggtc ttggtgatct 900 

tgtcctatgg ggagtctgca gtggtctcag ccacatccta tgtattttgg ctctggagga 96 0 

gcaaagctgt atcctggagt tggtctgtga tttgccgaca gccttgcagg ctgggctcag 102 0 

ggacaaagtc ccccccaaaa cccgcaggtc ctcatgtcca gacgctgccc agtcctgtcc 1080 

tgaaaacagc acgccccagg cccacagaac cccccaccct acatttgcct tgggtggagc 114 0 

tgggggtggt cctaggactg cgggtgccct tagctgaagg gggtggggag aagcgtggac 12 00 

tgggcagcct gtgggtaatt ggaggttcat tgagaattga gtctttggaa acactaagaa 12 60 

aatcaaattt ttaaaagtta tttatggcct gggaaacaat ttgcatttgt ccccaaatac 1320 

gcttagctgt gtgccgctta gaacgatgcg aaaccatccc tctgtgtaag cccgtgccgt 13 80 

gtgactcgaa gcctagcgcc ctccctgcga agcatcagac gccacccagc cctgggggga 1440 

ggcccacgcc tgctggacca acgcgggttc tggggtgcac agcgccaggt taacgctgaa 1500 

gcctgccccg ctgagcccag gagccgggag gcctgcgggc tgacccagaa tccgatcatg 1560 

cacctgtcct catgccagcg gctttggctg gggttggtct gaagcctgca cgcggcagtt 162 0 
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ctttgttaaa gatctgaggg actcctcagt cctggggcgt cgccgcctgc agcctcttcc 1680 

aagccctgcg tccagcgagc gtcacagcac aacctgcaaa aacggagctg ggctgcagct 1740 

ggggctggca tggactttca tttcagagat tcggttttta agaagatgca tgcctaatgt 1800 

gttctttttt ttttccaatg atttgtaata tacattttat gactggaaac ttttttgtac 1860 

aacactccaa taaacatttt gatttta 1Q81 

<210> 112 

<211> 4041 

<212> DNA 

<213> Homo sapien 

<400> 112 

catcttgcag ccggcggcgc ggattgaatg agcccgccga gcccgggccg ccgtcgggag 60 

cagcgcaggc cgcgagccgc cgccaccatg gccacgccgc tgcccggccg cgcgggcggg 120 

cccgccacgc cgctgtcgcc cacgcgcctg tcgcggctgc aggagaagga ggagctgcgc 180 

gagctcaacg accgcctggc gcactacatc gaccgcgtcc gcgcgctgga gctggagaac 240 

gaccggctcc tgctcaagat ctcagagaag gaggaggtga ccacgcgcga ggtgagtggc 3 00 

atcaaggcgc tgtacgagtc ggagctggcc gatgcccgga gagtcctgga tgagacggct 360 

cgagagcgtg cccggctgca gatagagatt gggaagctga gggcagagtt ggacgaggtc 42 0 

aacaagagcg ccaagaagag ggagggcgag cttacggtgg cccagggccg tgtgaaggac 480 

ctggagtccc tgttccaccg gagcgaggtg gagctggcag ctgccctcag cgacaagcgc 54 0 

ggcctggaga gtgacgtggc tgagctgcgg gcccagctgg ccaaggccga ggacggtcat 600 

gcagtggcca aaaagcagct ggagaaggag acgctgatgc gtgtggacct ggagaaccgc 660 

tgccagagcc tgcaggagga gctggacttc cggaagagtg tgttcgagga ggaggtgcgg 720 

gagacgcggc ggcggcacga gcggcgcctg gtggaggtgg acagcagccg gcagcaggag 780 

tacgacttca agatggcaca ggcgctggag gagctgcgga gccagcacga cgagcaagtg 84 0 

cggctctaca agctggagct ggagcagacc taccaggcca agctggacag cgccaagctg 9 00 

agctctgacc agaacgacaa ggcggccagt gcggctcgcg aggagctgaa ggaggcccgc 960 

atgcgcctgg agtccctcag ctaccagctc tccggcctcc agaagcaggc cagtgccgct 1020 

gaagatcgca ttcgggagct ggaggaggcc atggccgggg agcgggacaa gttccggaag 108 0 

atgctggacg ccaaggagca ggagatgacg gagatgcggg acgtgatgca gcagcagctg 1140 

gccgagtacc aggagctgct ggacgtgaag ctggccctgg acatggagat caacgcctac 1200 

cggaagctcc tggagggcga ggaggagagg ctgaagctgt cccccagccc atcctcgcgc 1260 

gtcaccgtct cacgagccac ctcgagcagc agcggcagct tgtccgccac cgggcgcctg 1320 
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ggccgcagta agcggaagcg gctggaggtg 


gaggagccct 


tgggcagcgg 


cccaagcgtc 


1380 


ctgggcacgg gcacgggtgg 


cagcggtggc 


ttccacctgg 


cccagcaggc 


ctcggcctcg 


1440 


ggtagcgtca gcatcgagga 


gatcgacctg 


gagggcaagt 


ttgtgcagct 


caagaacaac 


1500 


tcggacaagg atcagtctct 


ggggaactgg 


agaatcaaga ggcaggtctt 


ggagggggag 


1560 


gagatcgcct acaagttcac 


gcccaagtac 


atcctgcgcg ccggccagat 


ggtcacggtg 


1620 


tgggcagctg gtgcgggggt 


ggcccacagc 


cccccctcga 


cgctggtgtg gaagggccag 


1680 


agcagctggg gcacgggcga gagcttccgc accgtcctgg ttaacgcgga 


tggcgaggaa 


1740 


gtggccatga ggactgtgaa 


gaagtcctcg gtgatgcgtg agaatgagaa 


tggggaggaa 


1800 


gaggaggagg aagccgagtt 


tggcgaggag 


gatcttttcc 


accaacaggg 


ggacccgagg 


1860 


accacctcaa gaggctgcta 


cgtgatgtga 


acccacactc 


ctcatccaca 


cacctttctt 


1920 


tacccagagc cactgaaaac 


tatttttata 


tcattggctt 


tctttagttc 


ttgatacatt 


1980 


tctagagaat ttctaagcga 


actgccagaa 


cgtgtgggtg ggtctccccc 


agccctccct 


2040 


cctggcgggt ctcctccagc 


ctcacttcgc 


tgccacttcg ccgctgcccc 


ggagactttt 


2100 


caatcccacc ccactcctca 


tctcaccatt 


tggtcaaatt 


ggaagcccag 


ggccaggacc 


2160 


cggaggttta gaagatgctt 


gggcttggag ggaggagggc 


cggcgaggct 


agcgagggga 


2220 


caggagacgg ccctgctgcg gacggagcgc 


ggaaactgcg 


taggaattca 


gtggtggtgg 


2280 


gtttttttaa ggctttctac 


aaaaccaaat 


tcagaatcca 


ggcgtcgacc 


tggtggggcc 


2340 


cggggccaag cctgcattct 


ggctgcccag 


cttcggacag 


cgggaactcc 


tcaggcagcc 


2400 


acgcagcggg tgtgggccag 


catggggatg 


gcgtggcccc 


cagggcgggt 


tttcactccg 


2460 


ctgcctgggc ttccagattc 


ccgttctggc 


agcgcaccgg 


ccgggtttct 


cggaccgttg 


2520 


actttatttg ggggagtttt 


cccgcagttc 


agttcctgac 


tgtgcaaggc 


caacagggca 


2580 


ggggagggga agacctgggg 


aaggaagaat 


gaggacagtc 


ccgtcgtaag acctgtcaca 


2640 


acaataagca gggaggggag 


atgtggaggg 


gacacatctg 


gttgccttgg 


aggcagaagc 


2700 


tgtgagtttc agaacagctg 


tctgcaggga 


acgccaccat 


gttgacccto 


tggaggagag 


2760 


cgctgtggag cccctcccgt 


gttccagctc 


cgtctgccct 


gtgcctatat 


atacacatgc 


2820 


gtctatccat actgtgcttt 


tatctgtgat 


tttctcgctg aaaccatgtt 


tctcagacag 


2880 


gccaaggcca cctgactcct 


atcacgacgc 


acccaagccc 


ctcagtccag 


cttcccaatg 


2940 


cctggcaccc cccttcggca 


atagctcacc 


gtttacaccc 


tccctcatag 


atacacagaa 


3000 


gttatttttt taatggatat 


ttattttttt 


acattggtca 


gtacacaggt 


caggagctca 


3060 


cgccagggcc ttgaggacag gctgaccctc 


ctccccgggg 


tggcgtgggg 


ctggggcacc 


3120 
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ctccgacggc agagcctcct tcagaaagtg cagctcaagt cttaaagaca ccaaaactga 3180 

gccatgggca cgcgccgtct ccgggccatg gcgttcactg cagggcgggg gcggcaccgc 3240 

tcccctgtga ctgcatcccg cctccctggg gacctgcctg tggcaggaag gaatgggggg 3300 

ccccagcccc aggccgggaa ggagccagcg gccgacaaag cagaaacacc cgctgctcca 3360 

cgtagcccct gctcgctgtc cttgctctca gaagtcccgg tcccatgtag atagaggggg 3420 

gcgcatctta ccaaagcatt tcctcctgga ggctacgccg ctgtgctccc agtcaggcgg 3480 

ctggtaggga gctttgcctg ccccggggat accctctgcc agccgctgga agtgggaatg 3540 

ctggcgacag actgtgtcct ctttcccacc ttcatagcag gaatcacccg gacccgactg 3600 

gctgggcttc gtgctagcga gggttctggg ggtgggtctt ggtgatcttg tcctatgggg 3 660 

agtctgcagt ggtctcagcc acatcctatg tattttggct ctggaggagc aaagctgtat 3720 

cctggagttg gtctgtgatt tgccgacagc cttgcaggct gggctcaggg acaaagtccc 3780 

ccccaaaacc cgcaggtcct cattcctggg gcgtcgccgc ctgcagcctc ttccaagccc 3 84 0 

tgcgtccagc gagcgtcaca gcacaacctg caaaaacgga gctgggctgc agctggggct 3900 

ggcatggact ttcatttcag agattcggtt tttaagaaga tgcatgccta atgtgttctt 3960 

ttttttttcc aatgatttgt aatatacatt ttatgactgg aaactttttt gtacaacact 4020 

ccaataaaca ttttgatttt a 4041 

<210> 113 

<211> 165 

<212> PRT 

<213> Homo sapien 

<400> 113 

Gly Gly Arg Asn Arg Gly Arg Ala Gly Ser Gin Gly Gly Arg Gly Gly 
15 10 15 

Gly Ala Gin Ala Ala Ala Arg Val Asn Arg Gly Gly Gly Pro He Arg 
20 25 30 

Asn Arg Pro Ala He Ala Arg Gly Ala Ala Gly Gly Gly Gly Arg Asn 
35 40 45 

Arg Pro Ala Pro Tyr Ser Arg Pro Lys Gin Leu Pro Asp Lys Trp Gin 
50 55 60 

His Asp Leu Phe Asp Ser Gly Phe Gly Gly Gly Ala Gly Val Glu Thr 
65 70 75 80 



Gly Gly Lys Leu Leu Val Ser Asn Leu Asp Phe Gly Val Ser Asp Ala 
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85 90 95 



Asp lie Gin Glu Leu Phe Ala Glu Phe Gly Thr Leu Lys Lys Ala Ala 
100 105 110 



Val His Tyr Asp Arg Ser Gly Arg Ser Leu Gly Thr Ala Asp Val His 
115 120 125 



Phe Glu Arg Lys Ala Asp Ala Leu Lys Ala Met Lys Gin Tyr Asn Gly 
130 135 140 



Val Pro Leu Asp Ala Ser Tyr lie Pro Pro Leu Leu Gin Leu Leu Pro 
145 150 155 160 



Glu Asp Ser Leu Leu 
165 



<210> 114 

<211> 164 

<212> PRT 

<213> Homo sapien 

<220> 

< 2 2 1 > MI SC_FEATURE 

<222> (4).. (4) 

<223> X=any amino acid 



<400> 114 

Gly Ala Gly Xaa Ala Pro Gly Arg Leu Gin Gly Gly Arg Gly Gly Gly 
15 10 15 



Ala Gin Ala Ala Ala Arg Val Asn Arg Gly Gly Gly Pro lie Arg Asn 
20 25 30 



Arg Pro Ala lie Ala Arg Gly Ala Ala Gly Gly Gly Gly Arg Asn Arg 
35 40 45 



Pro Ala Pro Tyr Ser Arg Pro Lys Gin Leu Pro Asp Lys Trp Gin His 
50 55 60 



Asp Leu Phe Asp Ser Gly Phe Gly Gly Gly Ala Gly Val Glu Thr Gly 
65 70 75 80 



Gly Lys Leu Leu Val Ser Asn Leu Asp Phe Gly Val Ser Asp Ala Asp 
85 90 95 



He Gin Glu Leu Phe Ala Glu Phe Gly Thr Leu Lys Lys Ala Ala Val 
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100 105 HO 



His Tyr Asp Arg Ser Gly Arg Ser Leu Gly Thr Ala Asp Val His Phe 
115 120 125 



Glu Arg Lys Ala Asp Ala Leu Lys Ala Met Lys Gin Tyr Asn Gly Val 
130 135 140 



Pro Leu Asp Ala Ser Tyr He Pro Pro Leu Leu Gin Leu Leu Pro Glu 
145 150 155 160 



Asp Ser Leu Leu 



<210> 115 

<211> 256 

<212> PRT 

<213> Homo sapien 

<400> 115 

Met Ala Leu Arg Val Thr Arg Asn Ser Lys He Asn Ala Glu Asn Lys 
15 10 15 



Ala Lys He Asn Met Ala Gly Ala Lys Arg Val Pro Thr Ala Pro Ala 
20 25 30 



Ala Thr Ser Lys Pro Gly Leu Arg Pro Arg Thr Ala Leu Gly Asp He 
35 40 * 45 



Gly Asn Lys Val Ser Glu Gin Leu Gin Ala Lys Met Pro Met Lys Lys 
50 • 55 60 



Glu Ala Lys Pro Ser Ala Thr Gly Lys Val He Asp Lys Lys Leu Pro 
65 ~ 70 75 80 



Lys Pro Leu Glu Lys Val Pro Met Leu Val Pro Val Pro Val Ser Glu 
85 90 95 



Pro Val Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Val Lys Glu 
100 105 HO 



Glu Lys Leu Ser Pro Glu Pro He Leu Val Asp Thr Ala Ser Pro Ser 
115 120 125 



Pro Met Glu Thr Ser Gly Cys Ala Pro Ala Glu Glu Asp Leu Cys Gin 
130 135 140 
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Ala Phe Ser Asp Val lie Leu Ala Val Asn Asp Val Asp Ala Glu Asp 
145 150 155 160 



Gly Ala Asp Pro Asn Leu Cys Ser Glu Tyr Val Lys Asp lie Tyr Ala 
165 170 175 



Tyr Leu Arg Gin Leu Glu Glu Glu Gin Ala Val Arg Pro Lys Tyr Leu 
180 185 190 



Leu Gly Arg Glu Val Thr Gly Asn Met Arg Ala lie Leu lie Asp Trp 
195 200 205 



Leu Val Gin Val Gin Met Lys Ser Val Cys Ala Gly Pro Val Cys Gly 
210 215 220 



Pro He Asp Gly Pro Ala Lys Leu Gly Ala Gin He Ala Gly Gly Pro 
225 230 235 240 



Ala Val Trp Pro Leu Lys Gly Pro Arg Gly Arg Trp Gly Thr Leu Ala 
245 250 255 



<210> 116 

<211> 250 

<212> PRT 

<213> Homo sapien 

<400> 116 

Ala Gly Ser Ser Arg Arg Ala Ala Ala Glu Arg Leu Leu Val Ser Ala 
1 5 10 15 



Gly Cys Arg Ser Leu Ala Gly Arg Ala Ser Gly Val Leu Leu Leu Pro 
20 25 30 



Ala Glu Leu Leu Pro Gly Glu Glu Glu Ala Met Ala Leu Arg Val Thr 
35 40 45 



Arg Asn Ser Lys He Asn Ala Glu Asn Lys Ala Lys He Asn Met Ala 
50 55 60 



Gly Ala Lys Arg Val Pro Thr Ala Pro Ala Ala Thr Ser Lys Pro Gly 
65 70 75 80 



Leu Arg Pro Arg Thr Ala Leu Gly Asp He Gly Asn Lys Val Ser Glu 
85 90 95 



Gin Leu Gin Ala Lys Met Pro Met Lys Lys Glu Ala Lys Pro Ser Ala 
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100 105 11° 

Thr Gly Lys Val He Asp Lys Lys Leu Pro Lys Pro Leu Glu Lys Val 
115 120 125 

Pro Met Leu Val Pro Val Pro Val Ser Glu Pro Val Pro Glu Pro Glu 
130 135 140 

Pro Glu Pro Glu Pro Glu Pro Val Lys Glu Glu Lys Leu Ser Pro Glu 
145 150 155 160 

Pro He Leu Val Asp Thr Ala Ser Pro Ser Pro Met Glu Thr Ser Gly 
165 170 175 

Cys Ala Pro Ala Glu Glu Asp Leu Cys Gin Ala Phe Ser Asp Val He 
180 185 190 

Leu Ala Val Asn Asp Val Asp Ala Glu Asp Gly Ala Asp Pro Asn Leu 
195 200 205 

Cys Ser Glu Tyr Val Lys Asp He Tyr Ala Tyr Leu Arg Gin Leu Glu 
210 " 215 220 

Glu Glu Gin Ala Val Arg Pro Lys Tyr Leu Leu Gly Arg Glu Val Thr 
225 230 235 240 



Gly Asn Met Arg Ala He Leu He Asp Trp 
245 250 



<210> 117 

<211> 406 

<212> PRT 

<213> Homo sapien 

<400> 117 

Met Glu Ala Ala Ala Val Thr Val Thr Arg Ser Ala Thr Arg Arg Arg 
15 10 15 

Arg Arg Gin Leu Gin Gly Leu Ala Ala Pro Glu Ala Gly Thr Gin Glu 
20 25 30 



Glu Gin Glu Asp Gin Glu Pro Arg Pro Arg Arg Arg Arg Pro Gly Arg 
35 40 45 - 



Ser He Lys Asp Glu Glu Glu Glu Thr Val Phe Arg Glu Val Val Ser 
50 55 60 
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Phe Ser Pro Asp Pro Leu Pro Val Arg Tyr Tyr Asp Lys Asp Thr Thr 
65 70 75 80 



Lys Pro lie Ser Phe Tyr Leu Ser Ser Leu Glu Glu Leu Leu Ala Trp 
85 90 95 



Lys Pro Arg Leu Glu Asp Gly Phe Asn Val Ala Leu Glu Pro Leu Ala 
100 105 110 



Cys Arg Gin Pro Pro Leu Ser Ser Gin Arg Pro Arg Thr Leu Leu Cys 
115 120 125 



His Asp Met Met Gly Gly Tyr Leu Asp Asp Arg Phe lie Gin Gly Ser 
130 135 140 



Val Val Gin Thr Pro Tyr Ala Phe Tyr His Trp Gin Cys lie Asp Val 
145 150 155 160 



Phe Val Tyr Phe Ser His His Thr Val Thr lie Pro Pro Val Gly Trp 
165 170 175 



Thr Asn Thr Ala His Arg His Gly Val Cys Val Leu Gly Thr Phe lie 
180 185 190 



Thr Glu Trp Asn Glu Gly Gly Arg Leu Cys Glu Ala Phe Leu Ala Gly 
195 200 205 



Asp Glu Arg Ser Tyr Gin Ala Val Ala Asp Arg Leu Val Gin lie Thr 
210 " 215 220 



Gin Phe Phe Arg Phe Asp Gly Trp Leu lie Asn lie Glu Asn Ser Leu 
225 230 235 240 



Ser Leu Ala Ala Val Gly Asn Met Pro Pro Phe Leu Arg Tyr Leu Thr 
245 250 255 



Thr Gin Leu His Arg Gin Val Pro Gly Gly Leu Val Leu Trp Tyr Asp 
260 265 270 



Ser Val Val Gin Ser Gly Gin Leu Lys Trp Gin Asp Glu Leu Asn Gin 
275 280 285 



His Asn Arg Val Phe Phe Asp Ser Cys Asp Gly Phe Phe Thr Asn Tyr 
290 295 300 
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Asn Trp Arg Glu Glu His Leu Glu Arg Met Leu Gly Gin Ala Gly Glu 
305 ~ 310 315 320 

Arg Arg Ala Asp Val Tyr Val Gly Val Asp Val Phe Ala Arg Gly Asn 
325 330 335 



Val Val Gly Gly Arg Phe Asp Thr Asp Lys Val Gly Gly Gly Phe Arg 
340 345 350 



Pro Arg Ala Ser Gly Pro Val Pro Pro Leu Gly Pro His Phe Leu Met 
355 360 365 



Asp Leu Pro Phe Pro Ser Ala Pro Gin Arg Asn Asp Ser Ser Cys Ser 
370 375 380 



Ser Gin Ser Gly Asp Pro Val Ala Leu Arg Asn Arg Cys Pro Ala Pro 
385 390 395 400 



Ala Lys Leu Cys Pro His 
405 



<210> 118 

<211> 525 

<212> PRT 

<213> Homo sapien 

<400> 118 

Met Glu Ala Ala Ala Val Thr Val Thr Arg Ser Ala Thr Arg Arg Arg 
15 10 15 



Arg Arg Gin Leu Gin Gly Leu Ala Ala Pro Glu Ala Gly Thr Gin Glu 
20 25 3 0 



Glu Gin Glu Asp Gin Glu Pro Arg Pro Arg Arg Arg Arg Pro Gly Arg 
35 40 45 



Ser He Lys Asp Glu Glu Glu Glu Thr Val Phe Arg Glu Val Val Ser 
50 55 60 



Phe Ser Pro Asp Pro Leu Pro Val Arg Tyr Tyr Asp Lys Asp Thr Thr 
65 70 75 80 



Lys Pro He Ser Phe Tyr Leu Ser Ser Leu Glu Glu Leu Leu Ala Trp 
85 90 95 



Lys Pro Arg Leu Glu Asp Gly Phe Asn Val Ala Leu Glu Pro Leu Ala 
100 105 110 
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Cys Arg Gin Pro Pro Leu Ser Ser Gin Arg Pro Arg Thr Leu Leu Cys 
115 120 125 



His Asp Met Met Gly Gly Tyr Leu Asp Asp Arg Phe He Gin Gly Ser 
130 135 140 



Val Val Gin Thr Pro Tyr Ala Phe Tyr His Trp Gin Cys He Asp Val 
145 150 155 160 



Phe Val Tyr Phe Ser His His Thr Val Thr He Pro Pro Val Gly Trp 
165 170 175 



Thr Asn Thr Ala His Arg His Gly Val Cys Val Leu Gly Thr Phe He 
180 185 190 



Thr Glu Trp Asn Glu Gly Gly Arg Leu Cys Glu Ala Phe Leu Ala Gly 
195 200 205 



Asp Glu Arg Ser Tyr Gin Ala Val Ala Asp Arg Leu Val Gin He Thr 
210 215 220 



Gin Phe Phe Arg Phe Asp Gly Trp Leu He Asn He Glu Asn Ser Leu 
225 ^ 230 235 240 



Ser Leu Ala Ala Val Gly Asn Met Pro Pro Phe Leu Arg Tyr Leu Thr 
245 250 255 



Thr Gin Leu His Arg Gin Val Pro Gly Gly Leu Val Leu Trp Tyr Asp 
260 265 270 



Ser Val Val Gin Ser Gly Gin Leu Lys Trp Gin Asp Glu Leu Asn Gin 
275 280 285 



His Asn Arg Val Phe Phe Asp Ser Cys Asp Gly Phe Phe Thr Asn Tyr 
290 295 300 . 



Asn Trp Arg Glu Glu His Leu Glu Arg Met Leu Gly Gin Ala Gly Glu 
305 " 310 315 320 



Arg Arg Ala Asp Val Tyr Val Gly Val Asp Val Phe Ala Arg Gly Asn 
325 330 335 



Val Val Gly Gly Arg Phe Asp Thr Asp Lys Ser Leu Glu Leu He Arg 
340 345 350 
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Lys Hie Gly Phe Ser Val Ala Leu Phe Ala Pro Gly Trp Val Tyr Glu 
355 360 365 

Cys Leu Glu Lys Lys Asp Phe Phe Gin Asn Gin Asp Lys Phe Trp Gly 
370 375 380 

Arg Leu Glu Arg Tyr Leu Pro Thr His Ser He Cys Ser Leu Pro Phe 
385 390 395 400 

Val Thr Ser Phe Cys Leu Gly Met Gly Ala Arg Arg Val Cys Tyr Gly 
405 410 415 



Gin Glu Glu Ala Val Gly Pro Trp Tyr His Leu Ser Ala Gin Glu He 
420 425 430 



Gin Pro Leu Phe Gly Glu His Arg Leu Gly Gly Asp Gly Arg Gly Trp 
435 440 445 



Val Arg Thr His Cys Cys Leu Glu Asp Ala Trp His Gly Gly Ser Ser 
450 455 460 



Leu Leu Val Arg Gly Val He Pro Pro Glu Val Gly Asn Val Ala Val 
465 470 475 480 



Arg Trp Val Ser Asp Gly Gly Arg Trp Ala His Gin Leu Leu Pro Ser 
485 490 495 



His Val Val Ala Met Glu Leu Asp Arg Trp Gly Ser Gly Gly Gin Asn 
500 505 510 



Lys Asp Arg Gly Gin Thr Gin Met Gly Phe Leu Lys Leu 
515 520 525 



<210> 119 

<211> 327 

<212> PRT 

<213> Homo sapien 

<400> 119 

Met Phe Gin Pro Thr Lys Glu Ser Gly Leu Gly Gly Gly Leu Val Pro 
1 5 10 15 



Trp Leu Arg Thr Gly Pro Arg Cys Gly Ser He Trp His Pro Gly Pro 
20 25 30 



Leu Phe Leu Glu Gly Gly Ala Gly Gly Arg Asp Leu Glu Leu Ala Ser 
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35 40 45 



lie Ser Gin Trp Ser Leu His Gly Thr His His Arg Thr Phe Phe Pro 
50 55 60 

Arg Leu Phe Ser Leu Gin Ala Pro Val Pro Pro Lys lie Tyr Leu Ser 
65 70 75 80 

Met Val Tyr Lys Leu Glu Gly Pro Thr Asp Val Thr Val Ala Leu Glu 
85 90 95 



Leu Thr Thr Gly Asp Ala Gly Ser Cys His lie Gly Gly lie Ser Val 
100 105 110 



Leu Asn Ala Glu Thr Ser Ser Arg His Ser Leu Arg Pro Leu Arg Val 
115 120 125 



Pro Pro Thr Lys Leu Ala Arg Trp Val Gly Arg Cys Gly Arg Gin Leu 
130 135 140 



Ser Gly Gly Trp Val Gin His Cys .Tyr Glu Val Ser Leu Arg Gly Cys 
145 150 155 160 



Leu Leu Leu Asp Leu Leu Val Cys Phe Ser Arg Pro Pro Gly Ser Arg 
165 170 175 



Glu Glu Glu Ser Phe Thr Cys Arg Leu Gly Glu lie Gin Val Val Asp 
180 185 190 



Ala Ala Ser Leu Leu Ala Pro Leu Pro Gin Val Gin Ala Val Thr He 
195 200 205 



Ser His He Arg Trp Gin Pro Ser Ala Ser Glu Arg Glu Gly Pro Pro 
210 215 220 



Ala Leu Leu Gin Leu Ser Cys Thr Leu His Trp Ser Phe Leu Leu Ser 
225 230 235 240 



Gin Val Arg Cys Phe Arg He His Cys Trp Gly Gly Met Ser Asp Asp 
245 250 255 



Ser Pro Gly Arg Glu Leu Pro Arg Pro Glu Met Pro Met Phe Leu Gly 
260 265 270 



Leu Ala Phe Ala Thr Gin Tyr Arg He Val Asp Leu Leu Val Glu Ala 
275 280 285 
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Ala Gly Pro Gly Gin Asp Arg Arg Met Glu Phe Leu Val Glu Pro Val 
290 295 300 

Pro Lys Glu Gly Phe Arg Val Pro Gin Ala Glu Trp Gly Arg Ala Val 
305 310 315 320 



Leu Leu Tyr Ser Ala Pro Ala 
325 



<210> 120 

<211> 384 

<212> PRT 

<213> Homo sapien 

<400> 120 

Gin lie Pro Arg Thr Val Ser Ser Cys Arg Thr Gly Leu Ser Pro Leu 
15 10 15 



His lie Ser Pro Pro Ser Ser Pro Ser Pro Pro Lys Pro Pro Leu Tyr 
20 25 30 



Ser Ala Ser Val Ser Leu Asp Thr Leu Asp Ala Pro Tyr Glu Gly lie 
35 40 45 



Pro Tyr Gly lie Ser Glu Leu Arg Cys Phe Ser Pro Gin Lys Asn Leu 
50 55 60 



Ala Leu Gly Glu Asp Leu Ser Pro Gly Tyr Gly Gin Asp His Asp Val 
65 70 75 80 



Gly Ala Phe Gly Thr Gin Ala Pro Cys Ser Trp Arg Glu Gly Leu Val 
85 90 95 



Asp Ala lie Trp Ser Trp Leu Arg Phe Leu Ser Gly Leu Ser Thr Ala 
100 105 110 



Pro lie Thr Gly Pro Phe Ser Pro Gly Tyr Phe Tyr Ser Leu Gin Ala 
115 * 120 125 



Pro Val Pro Pro Lys lie Tyr Leu Ser Met Val Tyr Lys Leu Glu Gly 
130 135 140 



Pro Thr Asp Val Thr Val Ala Leu Glu Leu Thr Thr Gly Asp Ala Gly 
145 150 155 160 
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Ser Cys His He Gly Gly He Ser Val Leu Asn Ala Glu Thr Ser Ser 
165 170 175 



Arg His Ser Leu Arg Pro Leu Arg Val Pro Pro Thr Lys Leu Ala Arg 
180 185 190 



Trp Val Gly Arg Cys Gly Arg Gin Leu Ser Gly Gly Trp Val Gin His 
195 200 205 



Cys Tyr Glu Val Ser Leu Arg Gly Cys Leu Leu Leu Asp Leu Leu Val 
210 215 220 



Cys Phe Ser Arg Pro Pro Gly Ser Arg Glu Glu Glu Ser Phe Thr Cys 
225 230 235 240 



Arg Leu Gly Glu He Gin Val Val Asp Ala Ala Ser Leu Leu Ala Pro 
245 250 255 



Leu Pro Gin Val Gin Ala Val Thr He Ser His He Arg Trp Gin Pro 
260 265 270 



Ser Ala Ser Glu Arg Glu Gly Pro Pro Ala Leu Leu Gin Leu Ser Cys 
275 280 285 



Thr Leu His Trp Ser Phe Leu Leu Ser Gin Val Arg Cys Phe Arg He 
290 295 300 



His Cys Trp Gly Gly Met Ser Asp Asp Ser Pro Gly Arg Glu Leu Pro 
305 310 315 320 



Arg Pro Glu Met Pro Met Phe Leu Gly Leu Ala Phe Ala Thr Gin Tyr 
325 330 335 



Arg He Val Asp Leu Leu Val Glu Ala Ala Gly Pro Gly Gin Asp Arg 
340 345 350 



Arg Met Glu Phe Leu Val Glu Pro Val Pro Lys Glu Gly Phe Arg Val 
355 360 365 



Pro Gin Ala Glu Trp Gly Arg Ala Val Leu Leu Tyr Ser Ala Pro Ala 
370 375 380 



<210> 121 

<211> 450 

<212> PRT 

<213> Homo sapien 
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<400> 121 

Gin lie Pro Arg Thr Val Ser Ser Cys Arg Thr Gly Leu Ser Pro Leu 
15 10 15 



His lie Ser Pro Pro Ser Ser Pro Ser Pro Pro Lys Pro Pro Leu Tyr 
20 25 30 



Ser Ala Ser Val Ser Leu Asp Thr Leu Asp Ala Pro Tyr Glu Gly lie 
35 40 45 



Pro Tyr Gly lie Ser Glu Leu Arg Cys Phe Ser Pro Gin Lys Asn Leu 
50 55 60 



Ala Leu Gly Glu Asp Leu Ser Pro Gly Tyr Gly Gin Asp His Asp Val 
65 70 75 80 



Gly Ala Phe Gly Thr Gin Ala Pro Cys Ser Trp Arg Glu Gly Leu Val 
85 90 95 



Asp Ala lie Trp Ser Trp Leu Arg Phe Leu Ser Gly Leu Ser Thr Ala 
100 105 110 



Pro lie Thr Gly Pro Phe Ser Pro Gly Tyr Phe Tyr Ser Leu Gin Ala 
115 120 125 



Pro Val Pro Pro Lys He Tyr Leu Ser Met Val Tyr Lys Leu Glu Gly 
130 135 140 



Pro Thr Asp Val Thr Val Ala Leu Glu Leu Thr Thr Gly Asp Ala Gly 
145 150 155 160 



Ser Cys His He Gly Gly He Ser Val Leu Asn Ala Glu Thr Ser Ser 
165 170 175 



Arg His Ser Leu Arg Pro Leu Arg Val Pro Pro Thr Lys Leu Ala Arg 
180 185 190 



Trp Val Gly Arg Cys Gly Arg Gin Leu Ser Gly Gly Trp Val Gin His 
195 200 205 



Cys Tyr Glu Val Ser Leu Arg Gly Cys Leu Leu Leu Asp Leu Leu Val 
210 215 220 



Cys Phe Ser Arg Pro Pro Gly Ser Arg Glu Glu Glu Ser Phe Thr Cys 
225 230 235 240 
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Arg Leu Gly Glu lie Gin Val Met Leu Pro Arg Gly Ala Arg Ala Gly 
245 250 255 



Leu Ala Val Cys Pro Ala Gly Val Gly Val Glu Ala Ala Pro Gly Arg 
260 265 270 



Pro Leu Leu Gly Phe Ser Gly Glu Leu Gly Trp Arg Ser Gin Gly Gly 
275 280 285 



Glu Met Cys Ala Trp Gly His Pro Leu Pro Ala Pro Gly Arg Pro Ala 
290 295 300 



Val Leu Ser Leu Leu Ser Cys Gin Val Val Asp Ala Ala Ser Leu Leu 
305 310 315 320 



Ala Pro Leu Pro Gin Val Gin Ala Val Thr lie Ser His lie Arg Trp 
325 330 335 



Gin Pro Ser Ala Ser Glu Arg Glu Gly Pro Pro Ala Leu Leu Gin Leu 
340 345 350 



Ser Cys Thr Leu His Trp Ser Phe Leu Leu Ser Gin Val Arg Cys Phe 
355 360 365 



Arg lie His Cys Trp Gly Gly Met Ser Asp Asp Ser Pro Gly Arg Glu 
370 375 380 



Leu Pro Arg Pro Glu Met Pro Met Phe Leu Gly Leu Ala Phe Ala Thr 
385 390 395 400 



Gin Tyr Arg lie Val Asp Leu Leu Val Glu Ala Ala Gly Pro Gly Gin 
405 410 415 



Asp Arg Arg Met Glu Phe Leu Val Glu Pro Val Pro Lys Glu Gly Phe 
420 425 430 



Arg Val Pro Gin Ala Glu Trp Gly Arg Ala Val Leu Leu Tyr Ser Ala 
435 440 445 



Pro Ala 
450 



<210> 122 

<211> 302 

<212> PRT 

<213> Homo sapien 
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<400> 122 



Met Glu Ala Ala Ala Val Thr Val Thr Arg Ser Ala Thr Arg Arg Arg 
15 10 15 



Arg Arg Gin Leu Gin Gly Leu Ala Ala Pro Glu Ala Gly Thr Gin Glu 
20 25 30 



Glu Gin Glu Asp Gin Glu Pro Arg Pro Arg Arg Arg Arg Pro Gly Arg 
35 40 45 



Ser lie Lys Asp Glu Glu Glu Glu Thr Val Phe Arg Glu Val Val Ser 
50 55 60 



Phe Ser Pro Asp Pro Leu Pro Val Arg Tyr Tyr Asp Lys Asp Thr Thr 
65 70 75 80 



Lys Pro He Ser Phe Tyr Leu Ser Ser Leu Glu Glu Leu Leu Ala Trp 
85 90 95 



Lys Pro Arg Leu Glu Asp Gly Phe Asn Val Ala Leu Glu Pro Leu Ala 
100 105 110 



Cys Arg Gin Pro Pro Leu Ser Ser Gin Arg Pro Arg Thr Leu Leu Cys 
115 120 125 



His Asp Met Met Gly Gly Tyr Leu Asp Asp Arg Phe He Gin Gly Ser 
130 135 140 



Val Val Gin Thr Pro Tyr Ala Phe Tyr His Trp Gin Cys He Asp Val 
145 150 155 160 



Phe Val Tyr Phe Ser His His Thr Val Thr He Pro Pro Val Gly Trp 
165 170 175 



Thr Asn Thr Ala His Arg His Gly Val Cys Val Leu Gly Thr Phe He 
180 185 190 



Thr Glu Trp Asn Glu Gly Gly Arg Leu Cys Glu Ala Phe Leu Ala Gly 
195 200 205 



Asp Glu Arg Ser Tyr Gin Ala Val Ala Asp Arg Leu Val Gin He Thr 
210 215 220 



Gin Phe Phe Arg Phe Asp Gly Trp Leu He Asn He Glu Asn Ser Leu 
225 230 235 240 
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Ser Leu Ala Ala Val Gly Asn Met Pro Pro Phe Leu Arg Tyr Leu Thr 
245 250 255 



Thr Gin Leu Leu Val Glu Ala Ala Gly Pro Gly Gin Asp Arg Arg Met 
260 265 270 



Glu Phe Leu Val Glu Pro Val Pro Lys Glu Gly Phe Arg Val Pro Gin 
275 280 285 



Ala Glu Trp Gly Arg Ala Val Leu Leu Tyr Ser Ala Pro Ala 
290 295 300 



<210> 123 

<211> 162 

<212> PRT 

<213> Homo sapien 

<400> 123 

Met Pro Arg Trp Tyr Phe Leu Leu Ala Arg Cys Phe Gly Cys Ala Val 
15 10 15 



He Glu Asp Thr Trp His Tyr Phe Leu His Arg Leu Leu His His Lys 
20 25 30 



Arg He Tyr Lys Tyr He His Lys Val His His Glu Phe Gin Ala Pro 
35 40 45 



Phe Gly Met Glu Ala Glu Tyr Ala His Pro Leu Glu Thr Leu He Leu 
50 55 60 



Gly Thr Gly Phe Phe He Gly He Val Leu Leu Cys Asp His Val He 
65 70 75 80 



Leu Leu Trp Ala Trp Val Thr He Arg Leu Leu Glu Thr He Asp Val 
85 90 95 



His Ser Gly Tyr Asp He Pro Leu Asn Pro Leu Asn Leu He Pro Phe 
100 105 110 



Tyr Ala Gly Ser Arg His His Asp Phe His His Met Asn Phe He Gly 
115 120 125 



Asn Tyr Ala Ser Thr Phe Thr Trp Trp Asp Arg He Phe Gly Thr Asp 
130 135 140 
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Ser Gin Tyr Asn Ala Tyr Asn Glu Lys Arg Lys Lys Phe Glu Lys Lys 
145 150 155 160 



Thr Glu 



<210> 124 

<211> 206 

<212> PRT 

<213> Homo sapien 

<400> 124 

Met Gly Glu Pro Gin Gly Ser Met Arg lie Leu Val Thr Gly Gly Ser 
1 5 10 15 

Gly Leu Val Gly Lys Ala He Gin Lys Val Val Ala Asp Gly Ala Gly 
20 25 30 



Leu Pro Gly Glu Asp Trp Val Phe Val Ser Ser Lys Asp Ala Asp Leu 
35 40 45 

Thr Asp Thr Ala Gin Thr Arg Ala Leu Phe Glu Lys Val Gin Pro Thr 
50 55 60 

His Val He His Leu Ala Ala Met Val Gly Gly Leu Phe Arg Asn He 
65 70 75 80 



Lys Tyr Asn Leu Asp Phe Trp Arg Lys Asn Val His Met Asn Asp Asn 
85 90 95 



Val Leu His Ser Ala Phe Glu Val Gly Ala Arg Lys Val Val Ser Cys 
100 105 HO 

Leu Ser Thr Cys lie Phe Pro Asp Lys Thr Thr Tyr Pro He Asp Glu 
115 120 125 



Thr Met He His Asn Gly Pro Pro His Asn Ser Asn Phe Gly Tyr Ser 
130 135 140 

Tyr Ala Lys Arg Met lie Asp Val Gin Asn Ser Ala Gly Pro Thr Ser 

145 * 150 155 160 



Ser Ser Thr Ala Ala Pro Ser Pro Leu Ser Ser Pro Pro Thr Ser Ser 
165 170 175 



Gly Pro Thr Thr Thr Ser Thr Ser Arg Met Ala Thr Cys Cys Leu Ala 
180 185 190 
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Ser Ser Thr Arg Cys Thr Trp Pro Arg Ala Ala Ala Arg Pro 
195 200 ~ 205 



<210> 125 

<211> 380 

<212> PRT 

<213> Homo sapien 

<400> 125 

Leu Gin Val Pro Ala Val Pro Gly Thr Leu Arg Ala Pro Gly Thr Pro 
1 5 10 15 



Phe Pro Arg Val Pro Arg Pro Ser Leu Pro Ala Pro Pro Pro Thr Trp 
20 25 30 



Leu Arg Gly Gin Pro Glu Arg Thr Arg Pro Glu Ala Ala Val Gly Glu 
35 40 45 



Pro Ala Val Gly Leu Asp Ala Gly Ala Thr Asp Met Gly Glu Pro Gin 
50 55 60 



Gly Ser Met Arg lie Leu Val Thr Gly Gly Ser Gly Leu Val Gly Lys 
65 70 75 80 



Ala lie Gin Lys Val Val Ala Asp Gly Ala Gly Leu Pro Gly Glu Asp 
85 90 95 



Trp Val Phe Val Ser Ser Lys Asp Ala Asp Leu Thr Asp Thr Ala Gin 
100 105 110 



Thr Arg Ala Leu Phe Glu Lys Val Gin Pro Thr His Val lie His Leu 
115 120 125 



Ala Ala Met Val Gly Gly Leu Phe Arg Asn lie Lys Tyr Asn Leu Asp 
130 135 140 



Phe Trp Arg Lys Asn Val His Met Asn Asp Asn Val Leu His Ser Ala 
145 150 155 160 



Phe Glu Val Gly Ala Arg Lys Val Val Ser Cys Leu Ser Thr Cys lie 
165 170 175 



Phe Pro Asp Lys Thr Thr Tyr Pro He Asp Glu Thr Met He His Asn 
180 185 190 
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Gly Pro Pro His Asn Ser Asn Phe Gly Tyr Ser Tyr Ala Lys Arg Met 
195 200 205 



lie Asp Val Gin Asn Arg Ala Tyr Phe Gin Gin Tyr Gly Cys Thr Phe 
210 215 220 



Thr Ala Val lie Pro Thr Asn Val Phe Gly Pro His Asp Asn Phe Asn 
225 230 235 240 



He Glu Asp Gly His Val Leu Pro Gly Leu He His Lys Val His Leu 
245 250 255 



Ala Lys Ser Ser Gly Ser Ala Leu Thr Val Trp Gly Thr Gly Asn Pro 
260 265 270 



Arg Arg Gin Phe He Tyr Ser Leu Asp Leu Ala Gin Leu Phe He Trp 
275 280 285 



Val Leu Arg Glu Tyr Asn Glu Val Glu Pro He He Leu Ser Val Gly 
290 295 300 



Glu Glu Asp Glu Val Ser He Lys Glu Ala Ala Glu Ala Val Val Glu 
305 310 315 320 



Ala Met Asp Phe His Gly Glu Val Thr Phe Asp Thr Thr Lys Ser Asp 
325 330 335 



Gly Gin Phe Lys Lys Thr Ala Ser Asn Ser Lys Leu Arg Thr Tyr Leu 
340 345 350 



Pro Asp Phe Arg Phe Thr Pro Phe Lys Gin Ala Val Lys Glu Thr Cys 
355 360 365 



Ala Trp Phe Thr Asp Asn Tyr Glu Gin Ala Arg Lys 
370 375 380 



<210> 126 

<211> 380 

<212> PRT 

<213> Homo sapien 

<400> 126 

Leu Gin Val Pro Ala Val Pro Gly Thr Leu Arg Ala Pro Gly Thr Pro 
15 10 15 



Phe Pro Arg Val Pro Arg Pro Ser Leu Pro Ala Pro Pro Pro Thr Trp 
20 25 30 
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Leu Arg Gly Gin Pro Glu Arg Thr Arg Pro Glu Ala Ala Val Gly Glu 
35 40 45 



Pro Ala Val Gly Leu Asp Ala Gly Ala Thr Asp Met Gly Glu Pro Gin 
50 55 60 



Gly Ser Met Arg lie Leu Val Thr Gly Gly Ser Gly Leu Val Gly Lys 
65 70 75 80 



Ala lie Gin Lys Val Val Ala Asp Gly Ala Gly Leu Pro Gly Glu Asp 
85 90 95 



Trp Val Phe Val Ser Ser Lys Asp Ala Asp Leu Thr Asp Thr Ala Gin 
100 105 110 



Thr Arg Ala Leu Phe Glu Lys Val Gin Pro Thr His Val lie His Leu 
115 120 125 



Ala Ala Met Val Gly Gly Leu Phe Arg Asn lie Lys Tyr Asn Leu Asp 
130 135 140 



Phe Trp Arg Lys Asn Val His Met Asn Asp Asn Val Leu His Ser Ala 
145 150 155 160 



Phe Glu Val Gly Ala Arg Lys Val Val Ser Cys Leu Ser Thr Cys lie 
165 170 175 



Phe Pro Asp Lys Thr Thr Tyr Pro lie Asp Glu Thr Met lie His Asn 
180 185 190 



' Gly Pro Pro His Asn Ser Asn Phe Gly Tyr Ser Tyr Ala Lys Arg Met 
195 200 205 



lie Asp Val Gin Asn Arg Ala Tyr Phe Gin Gin Tyr Gly Cys Thr Phe 
210 215 220 



Thr Ala Val He Pro Thr Asn Val Phe Gly Pro His Asp Asn Phe Asn 
225 230 235 240 



He Glu Asp Gly His Val Leu Pro Gly Leu He His Lys Val His Leu 
245 250 255 



Ala Lys Ser Ser Gly Ser Ala Leu Thr Val Trp Gly Thr Gly Asn Pro 
260 265 270 
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Arg Arg Gin Phe He Tyr Ser Leu Asp Leu Ala Gin Leu Phe He Trp 
275 280 285 

Val Leu Arg Glu Tyr Asn Glu Val Glu Pro He He Leu Ser Val Gly 
290 295 300 

Glu Glu Asp Glu Val Ser He Lys Glu Ala Ala Glu Ala Val Val Glu 
305 310 315 320 

Ala Met Asp Phe His Gly Glu Val Thr Phe Asp Thr Thr Lys Ser Asp 
325 330 335 

Gly Gin Phe Lys Lys Thr Ala Ser Asn Ser Lys Leu Arg Thr Tyr Leu 
340 345 350 

Pro Asp Phe Arg Phe Thr Pro Phe Lys Gin Ala Val Lys Glu Thr Cys 
355 360 365 

Ala Trp Phe Thr Asp Asn Tyr Glu Gin Ala Arg Lys 
370 375 380 



<210> 127 

<211> 334 

<212> PRT 

<213> Homo sapien 

<400> 127 

Met Arg Ala Leu Ala Ala Asn Arg Val Asn Asp Leu Cys Gin Glu Pro 
x ~ 5 10 15 

Pro Ser Gin Gly Cys Leu Pro Pro Pro Leu Val Ser Gin Arg Gly Val 
20 25 30 

Glu Cys Thr Phe Ser Arg Pro Ser Gly Glu Ser Trp Val Gly Thr Ser 
35 40 45 

Cys Ser Gly Leu Gly Gly Ser Ser Gly Pro Leu Arg Arg Cys Arg Leu 
50 55 60 

Arg Ala Pro Arg Gly Thr Gly Leu Arg Arg Gly Ser Ala Ser Val Gin 
65 70 75 80 

Leu Gly Leu Ser Gly Cys Gin Trp Thr Met Pro His Ser Glu Gly Leu 
85 90 95 

Thr Leu Cys Gin Leu Pro Gin Lys Ser Gly Ala Pro Lys Asp Glu Ser 
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100 105 110 



Val Met Thr Ser Ala Ser His Cys Leu Thr Leu Gin Gly Ala Thr Asp 
115 120 125 



Met Gly Glu Pro Gin Gly Ser Met Arg lie Leu Val Thr Gly Gly Ser 
130 135 140 



Gly Leu Val Gly Lys Ala lie Gin Lys Val Val Ala Asp Gly Ala Gly 
145 150 155 160 



Leu Pro Gly Glu Asp Trp Val Phe Val Ser Ser Lys Asp Ala Asp Leu 
165 170 175 



Thr Asp Thr Ala Gin Thr Arg Ala Leu Phe Glu Lys Val Gin Pro Thr 
180 185 190 



His Val lie His Leu Ala Ala Met Val Gly Gly Leu Phe Arg Asn lie 
195 200 205 



Lys Tyr Asn Leu Asp Phe Trp Arg Lys Asn Val His Met Asn Asp Asn 
210 215 220 



Val Leu His Ser Ala Phe Glu Val Gly Ala Arg Lys Val Val Ser Cys 
225 230 235 240 



Leu Ser Thr Cys lie Phe Pro Asp Lys Thr Thr Tyr Pro lie Asp Glu 
245 250 255 



Thr Met lie His Asn Gly Pro Pro His Asn Ser Asn Phe Gly Tyr Ser 
260 265 270 



Tyr Ala Lys Arg Met lie Asp Val Gin Asn Ser Ala Gly Pro Thr Ser 
275 280 285 



Ser Ser Thr Ala Ala Pro Ser Pro Leu Ser Ser Pro Pro Thr Ser Ser 
290 295 300 



Gly Pro Thr Thr Thr Ser Thr Ser Arg Met Ala Thr Cys Cys Leu Ala 
305 310 315 320 



Ser Ser Thr Arg Cys Thr Trp Pro Arg Ala Ala Ala Arg Pro 
325 330 



<210> 128 
<211> 327 
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<212> PRT 

<213> Homo sapien 

<400> 128 

His Tyr Ser Ala Thr Asp Met Gly Glu Pro Gin Gly Ser Met Arg He 
1 5 10 15 

Leu Val Thr Gly Gly Ser Gly Leu Val Gly I»ys Ala He Gin Lys Val 
20 25 30 

Val Ala Asp Gly Ala Gly Leu Pro Gly Glu Asp Trp Val Phe Val Ser 
35 40 45 

Ser Lys Asp Ala Asp Leu Thr Asp Thr Ala Gin Thr Arg Ala Leu Phe 
50 55 60 

Glu Lys Val Gin Pro Thr His Val He His Leu Ala Ala Met Val Gly 
65 70 75 80 

Gly Leu Phe Arg Asn He Lys Tyr Asn Leu Asp Phe Trp Arg Lys Asn 
85 90 95 

Val His Met Asn Asp Asn Val Leu His Ser Ala Phe Glu Val Gly Ala 
100 105 HO 

Arg Lys Val Val Ser Cys Leu Ser Thr Cys He Phe Pro Asp Lys Thr 
115 120 125 

Thr Tyr Pro He Asp Glu Thr Met He His Asn Gly Pro Pro His Asn 
130 135 140 

Ser Asn Phe Gly Tyr Ser Tyr Ala Lys Arg Met He Asp Val Gin Asn 
145 150 155 160 

Arg Ala Tyr Phe Gin Gin Tyr Gly Cys Thr Phe Thr Ala Val He Pro 
165 170 175 

Thr Asn Val Phe Gly Pro His Asp Asn Phe Asn He Glu Asp Gly His 
180 185 190 

Val Leu Pro Gly Leu He His Lys Val His Leu Ala Lys Ser Ser Gly 
195 200 205 

Ser Ala Leu Thr Val Trp Gly Thr Gly Asn Pro Arg Arg Gin Phe He 
210 215 220 
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Tyr Ser Leu Asp Leu Ala Gin Leu Phe lie Trp Val Leu Arg Glu Tyr 
225 230 235 240 



Asn Glu Val Glu Pro lie lie Leu Ser Val Gly Glu Glu Asp Glu Val 
245 250 255 



Ser He Lys Glu Ala Ala Glu Ala Val Val Glu Ala Met Asp Phe His 
260 265 270 



Gly Glu Val Thr Phe Asp Thr Thr Lys Ser Asp Gly Gin Phe Lys Lys 
275 280 285 



Thr Ala Ser Asn Ser Lys Leu Arg Thr Tyr Leu Pro Asp Phe Arg Phe 
290 295 300 



Thr Pro Phe Lys Gin Ala Val Lys Glu Thr Cys Ala Trp Phe Thr Asp 
305 310 315 320 



Asn Tyr Glu Gin Ala Arg Lys 
325 



<210> 129 

<211> 161 

<212> PRT 

<213> Homo sapien 

<400> 129 

Met Gly Glu Pro Gin Gly Ser Met Arg He Leu Val Thr Gly Gly Ser 
15 10 15 



Gly Leu Val Gly Lys Ala He Gin Lys Val Val Ala Asp Gly Ala Gly 
20 25 30 



Leu Pro Gly Glu Asp Trp Val Phe Val Ser Ser Lys Asp Ala Asp Leu 
35 40 45 



Thr Asp Thr Ala Gin Thr Arg Ala Leu Phe Glu Lys Val Gin Pro Thr 
50 55 60 



His Val He His Leu Ala Ala Met Val Gly Gly Leu Phe Arg Asn He 
65 70 75 80 



Lys Tyr Asn Leu Asp Phe Trp Arg Lys Asn Val His Met Asn Asp Asn 
85 90 95 



Val Leu His Ser Ala Phe Glu Val Gly Ala Arg Lys Val Val Ser Cys 
100 105 110 
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Leu Ser Thr Cys lie Phe Pro Asp Lys Thr Thr Tyr Pro lie Asp Glu 
115 120 125 



Thr Met lie His Asn Gly Pro Pro His Asn Ser Asn Phe Gly Tyr Ser 
130 135 140 



Tyr Ala Lys Arg Met lie Asp Val Gin Asn Arg Ser Ser Arg Pro Ser 
145 150 155 160 



Cys 



<210> 130 

<211> 326 

<212> PRT 

<213> Homo sapien 

<400> 130 

Leu Gin Val Pro Ala Val Pro Gly Thr Leu Arg Ala Pro Gly Thr Pro 
15 10 15 



Phe Pro Arg Val Pro Arg Pro Ser Leu Pro Ala Pro Pro Pro Thr Trp 
20 25 30 



Leu Arg Gly Gin Pro Glu Arg Thr Arg Pro Glu Ala Ala Val Gly Glu 
35 40 45 



Pro Ala Val Gly Leu Asp Ala Gly Ala Thr Asp Met Gly Glu Pro Gin 
50 55 60 



Gly Ser Met Arg lie Leu Val Thr Gly Gly Ser Gly Leu Val Gly Lys 
65 70 75 80 



Ala lie Gin Lys Val Val Ala Asp Gly Ala Gly Leu Pro Gly Glu Asp 
85 90 95 



Trp Val Phe Val Ser Ser Lys Asp Ala Asp Leu Thr Asp Thr Ala Gin 
100 105 110 



Thr Arg Ala Leu Phe Glu Lys Val Gin Pro Thr His Val lie His Leu 
115 120 125 



Ala Ala Met Val Gly Gly Leu Phe Arg Asn lie Lys Tyr Asn Leu Asp 
130 135 140 
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Phe Trp Arg Lys Asn Val His Met Asn Asp Asn Val Leu His Ser Ala 
145 * 150 155 160 



Phe Glu Val Gly Ala Arg Lys Val Val Ser Cys Leu Ser Thr Cys lie 
165 170 175 



Phe Pro Asp Lys Thr Thr Tyr Pro lie Asp Glu Thr Met lie His Asn 
180 185 190 



Gly Pro Pro His Asn Ser Asn Phe Gly Tyr Ser Tyr Ala Lys Arg Met 
195 200 205 



lie Asp Val Gin Asn Arg Ala Tyr Phe Gin Gin Tyr Gly Cys Thr Phe 
210 215 220 



Thr Ala Val lie Pro Thr Asn Val Phe Gly Pro His Asp Asn Phe Asn 
225 230 235 240 



lie Glu Asp Gly His Val Leu Pro Gly Leu lie His Lys Val His Leu 
245 250 255 



Ala Lys Ser Ser Gly Ser Ala Leu Thr Val Trp Gly Thr Gly Asn Pro 
260 265 270 



Arg Arg Gin Phe He Tyr Ser Leu Asp Leu Ala Gin Leu Phe He Trp 
275 280 285 



Val Leu Arg Glu Tyr Asn Glu Val Glu Pro He He Leu Ser Gly Gly 
290 295 300 



Tyr Leu Ser Pro Gin Pro Pro Ser Ser Met Val Gly Gin Asp Pro Arg 
305 310 315 320 



Leu Ser Trp Glu Ala Gly 
325 



<210> 131 

<211> 216 

<212> PRT 

<213> Homo sapien 

<400> 131 

Met Gin Val Arg Thr Asp Pro Arg Ser Arg Gin Cys Trp Pro Leu Glu 
1 5 10 15 



His Arg Thr Trp Leu Thr Asp Ser His Ser Ser Cys Leu Phe Pro Leu 
20 25 30 
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Pro Leu Glu Gin Pro Ser Leu Leu Gin Ser Asn Pro Cys Pro Ser Phe 
35 40 45 



Leu Pro Leu Ser Arg Ala Ala Pro Pro Ala His Leu Arg Pro Gly Pro 
50 55 60 



Ser Tyr Leu Leu Pro Leu Leu Ser Cys Pro lie Pro Val Val Arg Arg 
65 70 75 80 



Glu Ser Thr Gly Gin Arg Pro Ser Ser Thr Cys Asp Leu Gly Glu Cys 
85 90 95 



Gin Ala Ser Pro Arg Gly Pro Gly Pro Arg Gly Pro Gly Arg Leu Cys 
100 105 110 



Cys Gly Gly Ser Arg Val Arg Thr Gly Ala Ala Ser Pro Leu Ala Val 
115 12 0 125 



Cys Leu Cys Pro Leu His Trp Pro Leu Glu Ala Gin Arg Pro Ser Gly 
130 135 140 



Tyr Phe Pro Ser Ser Gly Leu Pro Leu Met Leu Phe Pro Ala Pro Phe 
145 150 155 160 



Phe Tyr Leu Glu Thr Pro lie Pro Ser His Pro Leu Gin Arg Ser Ser 
165 170 175 



Gin Ser Cys Pro Gin His Gly Ser Leu His Ser Pro Trp Val Ser Pro 
180 185 190 



Pro Val Ser Cys Leu Pro Arg Thr Pro Asp Leu Pro Leu Pro Gly Trp 
195 200 205 



Pro Arg Trp lie Leu Tyr Ser Asp 
210 215 



<210> 132 

<211> 108 

<212> PRT 

<213> Homo sapien 

<400> 132 - 



Met Ala His Ala Thr Leu Ser Ala Ala Pro Ser Asn Pro Arg Leu Leu 
15 10 15 
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Arg Val Ala Leu Leu Leu Leu Leu Leu Val Ala Ala Ser Arg Arg Ala 
20 25 30 



Ala Gly Gly Ser Arg Arg Pro Gly Val Pro Gly Pro Asp Ala Ala Gly 
35 40 45 



Val Gly Ala Pro Arg Arg Thr Ala Pro Leu Asn Gin Arg Val Tyr Ser 
50 55 60 



Ser Leu Gly Ala Ser Val Val Thr Glu Leu Arg Cys Gin Cys Leu Gin 
65 70 75 80 



Thr Leu Gin Gly lie His Leu Lys Asn lie Gin Ser Val Asn Val Arg 
85 90 95 



Ser Pro Gly Pro His Cys Ala Gin Thr Glu Val Met 
100 105 



<210> 133 

<211> 142 

<212> PRT 

<213> Homo sapien 

<400> 133 

Lys Gly Ser Pro lie Leu Gly Ser His Thr Ala Arg Val Ala Gly Thr 
1 5 10 15 



Ser Pro Pro Ala Leu Pro Leu Leu Ala Gin Leu Pro Asp Ala Ser Ala 
20 25 30 



Glu Pro His Gly Pro Arg His Ala Leu Arg Arg Pro Gin Gin Ser Pro 
35 40 45 



Ala Pro Ala Gly Gly Ala Ala Ala Pro Ala Pro Gly Gly Arg Gin Pro 
50 55 60 



Ala Arg Ser Arg Trp Val Pro Ala Pro Trp Gly Pro Arg Ala Gly Arg 
65 ^ 70 75 80 



Gly Trp Gly Gly Arg Pro Ala Pro Thr Ala Pro Leu Asn Gin Arg Val 
85 90 95 



Tyr Ser Ser Leu Gly Ala Ser Val Val Thr Glu Leu Arg Cys Gin Cys 
100 105 110 



Leu Gin Thr Leu Gin Gly lie His Leu Lys Asn lie Gin Ser Val Asn 
115 120 125 
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Val Arg Ser Pro Gly Pro His Cys Ala Gin Thr Glu Val Met 
130 135 140 



<210> 134 

<211> 482 

<212> PRT 

<213> Homo sapien 

<400> 134 

Met Val Met Glu Lys Pro Ser Pro Leu Leu Val Gly Arg Glu Phe Val 
15 10 15 

Arg Gin Tyr Tyr Thr Leu Leu Asn Lys Ala Pro Glu Tyr Leu His Arg 
20 25 30 



Phe Tyr Gly Arg Asn Ser Ser Tyr Val His Gly Gly Val Asp Ala Ser 
35 40 45 



Gly Lys Pro Gin Glu Ala Val Tyr Gly Gin Asn Asp lie His His Lys 
50 55 60 



Val Leu Ser Leu Asn Phe Ser Glu Cys His Thr Lys lie Arg His Val 
65 70 75 80 



Asp Ala His Ala Thr Leu Ser Asp Gly Val Val Val Gin Val Met Gly 
85 90 95 



Leu Leu Ser Asn Ser Gly Gin Pro Glu Arg Lys Phe Met Gin Thr Phe 
100 105 110 



Val Leu Ala Pro Glu Gly Ser Val Pro Asn Lys Phe Tyr Val His Asn 
115 120 125 



Asp Met Phe Arg Tyr Glu Asp Glu Val Phe Gly Asp Ser Glu Pro Glu 
130 135 140 



Leu Asp Glu Glu Ser Glu Asp Glu Val Glu Glu Glu Gin Glu Glu Arg 
145 150 155 160 



Gin Pro Ser Pro Glu Pro Val Gin Glu Asn Ala Asn Ser Gly Tyr Tyr 
165 170 175 



Glu Ala His Pro Val Thr Asn Gly lie Glu Glu Pro Leu Glu Glu Ser 
180 185 190 
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Ser His Glu Pro Glu Pro Glu Pro Glu Ser Glu Thr Lys Thr Glu Glu 
195 200 205 



Leu Lys Pro Gin Val Glu Glu Lys Asn Leu Glu Glu Leu Glu Glu Lys 
210 215 220 



Ser Thr Thr Pro Pro Pro Ala Glu Pro Val Ser Leu Pro Gin Glu Pro 
225 230 235 240 



Pro Lys Ala Phe Ser Trp Ala Ser Val Thr Ser Lys Asn Leu Pro Pro 
245 250 255 



Ser Gly Thr Val Ser Ser Ser Gly He His Pro Met Leu Lys His Gin 
260 265 270 



Ser His Ser Gin Glu Ser Lys Leu Asn Gin Lys Phe Asn Leu Ser His 
275 280 285 



Leu Val Cys Val Asn Asn Asp Leu Glu Asn Asp Leu Val Phe Leu Leu 
290 295 300 



Glu Asp Gin Asp Gin Ala Glu Glu He Met Glu Gin Asn Asp Ser Asp 
305 310 315 320 



Asn Arg Arg He He Arg Tyr Pro Asp Ser His Gin Leu Phe Val Gly 
325 330 335 



Asn Leu Pro His Asp He Asp Glu Asn Glu Leu Lys Glu Phe Phe Met 
340 345 350 



Ser Phe Gly Asn Val Val Glu Leu Arg He Asn Thr Lys Gly Val Gly 
355 360 365 



Gly Lys Leu Pro Asn Phe Gly Phe Val Val Phe Asp Asp Ser Glu Pro 
370 375 380 



Val Gin Arg He Leu He Ala Lys Pro He Met Phe Arg Gly Glu Val 
385 390 395 400 



Arg Leu Asn Val Glu Glu Lys Lys Thr Arg Ala Ala Arg Glu Arg Glu 
405 410 415 



Thr Arg Gly Gly Gly Asp Asp Arg Arg Asp He Arg Arg Asn Asp Arg 
420 425 430 



Gly Pro Gly Gly Pro Arg Gly He Val Gly Gly Gly Met Met Arg Asp 
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435 440 445 



Arg Asp Gly Arg Gly Pro Pro Pro Arg Gly Gly Met Ala Gin Lys Leu 
450 455 460 

Gly Ser Gly Arg Gly Thr Gly Gin Met Glu Gly Arg Phe Thr Gly Gin 
465 470 475 480 



Arg Arg 



<210> 135 

<211> 392 

<212> PRT 

<213> Homo sapien 

<400> 135 

Leu Ser Arg Ser Trp Val Cys Cys Leu Thr Val Asp Asn Gin Lys Glu 
15 10 15 

Ser Leu Cys Lys Pro Leu Phe Trp Leu Leu Lys Asp Leu Phe Gin lie 
20 25 30 



Asn Phe Met Phe Thr Met He Cys Phe Val Met Lys Met Lys Cys Tyr 
35 40 45 



Gly Asp Ser Glu Pro Glu Leu Asp Glu Glu Ser Glu Asp Glu Val Glu 
50 55 60 



Glu Glu Gin Glu Glu Arg Gin Pro Ser Pro Glu Pro Val Gin Glu Asn 
65 70 75 80 



Ala Asn Ser Gly Tyr Tyr Glu Ala His Pro Val Thr Asn Gly He Glu 
85 90 95 



Glu Pro Leu Glu Glu Ser Ser His Glu Pro Glu Pro Glu Pro Glu Ser 
100 105 HO 



Glu Thr Lys Thr Glu Glu Leu Lys Pro Gin Val Glu Glu Lys Asn Leu 
115 120 125 



Glu Glu Leu Glu Glu Lys Ser Thr Thr Pro Pro Pro Ala Glu Pro Val 
130 135 140 



Ser Leu Pro Gin Glu Pro Pro Lys Ala Phe Ser Trp Ala Ser Val Thr 
145 150 155 160 
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Ser Lys Asn Leu Pro Pro Ser Gly Thr Val Ser Ser Ser Gly lie Pro 
165 170 175 



Pro His Val Lys Ala Pro Val Ser Gin Pro Arg Val Glu Ala Lys Pro 
180 185 190 



Glu Val Gin Ser Gin Pro Pro Arg Val Arg Glu Gin Arg Pro Arg Glu 
195 200 205 



Arg Pro Gly Phe Pro Pro Arg Gly Pro Arg Pro Gly Arg Gly Asp Met 
210 215 220 



Glu Gin Asn Asp Ser Asp Asn Arg Arg lie lie Arg Tyr Pro Asp Ser 
225 230 235 240 



His Gin Leu Phe Val Gly Asn Leu Pro His Asp lie Asp Glu Asn Glu 
245 250 255 



Leu Lys Glu Phe Phe Met Ser Phe Gly Asn Val Val Glu Leu Arg lie 
260 265 270 



Asn Thr Lys Gly Val Gly Gly Lys Leu Pro Asn Phe Gly Phe Val Val 
275 280 285 



Phe Asp Asp Ser Glu Pro Val Gin Arg lie Leu lie Ala Lys Pro lie 
290 295 300 



Met Phe Arg Gly Glu Val Arg Leu Asn Val Glu Glu Lys Lys Thr Arg 
305 310 315 320 



Ala Ala Arg Glu Arg Glu Thr Arg Gly Gly Gly Asp Asp Arg Arg Asp 
325 330 335 



lie Arg Arg Asn Asp Arg Gly Pro Gly Gly Pro Arg Gly lie Val Gly 
340 345 350 



Gly Gly Met Met Arg Asp Arg Asp Gly Arg Gly Pro Pro Pro Arg Gly 
355 360 365 



Gly Met Ala Gin Lys Leu Gly Ser Gly Arg Gly Thr Gly Gin Met Glu 
370 375 380 



Gly Arg Phe Thr Gly Gin Arg Arg 
385 390 
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<210> 136 
<211> 316 
<212> PRT 
<213> Homo sapien 

<400> 136 

Asp Trp Glu Glu Lys Arg Val Leu Ala lie Cys Leu Ala Ser Gin Ser 
15 10 15 

Glu Asp Glu Val Glu Glu Glu Gin Glu Glu Arg Gin Pro Ser Pro Glu 
20 25 30 



Pro Val Gin Glu Asn Ala Asn Ser Gly Tyr Tyr Glu Ala His Pro Val 
35 40 45 



Thr Asn Gly lie Glu Glu Pro Leu Glu Glu Ser Ser His Glu Pro Glu 
50 55 60 

Pro Glu Pro Glu Ser Glu Thr Lys Thr Glu Glu Leu Lys Pro Gin Val 
65 70 75 80 



Glu Glu Lys Asn Leu Glu Glu Leu Glu Glu Lys Ser Thr Thr Pro Pro 
85 90 95 



Pro Ala Glu Pro Val Ser Leu Pro Gin Glu Pro Pro Lys Pro Arg Val 
100 105 HO 



Glu Ala Lys Pro Glu Val Gin Ser Gin Pro Pro Arg Val Arg Glu Gin 
115 120 125 



Arg Pro Arg Glu Arg Pro Gly Phe Pro Pro Arg Gly Pro Arg Pro Gly 
13 0 135 14 0 



Arg Gly Asp Met Glu Gin Asn Asp Ser Asp Asn Arg Arg lie lie Arg 
145 * 150 155 160 



Tyr Pro Asp Ser His Gin Leu Phe Val Gly Asn Leu Pro His Asp He 
165 170 175 



Asp Glu Asn Glu Leu Lys Glu Phe Phe Met Ser Phe Gly Asn Val Val 
180 185 190 



Glu Leu Arg He Asn Thr Lys Gly Val Gly Gly Lys Leu Pro Asn Phe 
195 200 205 



Gly Phe Val Val Phe Asp Asp Ser Glu Pro Val Gin Arg He Leu He 
210 215 220 



WO 2004/050900 



PCT/US2003/040131 



195 



Ala Lys Pro He Met Phe Arg Gly Glu Val Arg Leu Asn Val Glu Glu 
225 230 235 240 



Lys Lys Thr Arg Ala Ala Arg Glu Arg Glu Thr Arg Gly Gly Gly Asp 
245 250 255 



Asp Arg Arg Asp He Arg Arg Asn Asp Arg Gly Pro Gly Gly Pro Arg 
260 265 270 



Gly He Val Gly Gly Gly Met Met Arg Asp Arg Asp Gly Arg Gly Pro 
275 280 285 



Pro Pro Arg Gly Gly Met Ala Gin Lys Leu Gly Ser Gly Arg Gly Thr 
290 295 300 



Gly Gin Met Glu Gly Arg Phe Thr Gly Gin Arg Arg 
305 310 315 



<210> 


137 


<211> 


314 


<212> 


PRT 


<213> 


Homo sapien 


<220> 




<221> 


MISC FEATURE 


<222> 


(4) (4) 


<223> 


X=any amino i 


<400> 


13 7 



Leu Gly Gly Xaa Glu Ser Gin Leu Leu Leu Ala Ser Gin Ser Glu Asp 
1 5 10 15 



Glu Val Glu Glu Glu Gin Glu Glu Arg Gin Pro Ser Pro Glu Pro Val 
20 25 30 



Gin Glu Asn Ala Asn Ser Gly Tyr Tyr Glu Ala His Pro Val Thr Asn 
35 40 45 



Gly lie Glu Glu Pro Leu Glu Glu Ser Ser His Glu Pro Glu Pro Glu 
50 55 60 



Pro Glu Ser Glu Thr Lys Thr Glu Glu Leu Lys Pro Gin Val Glu Glu 
65 70 75 80 



Lys Asn Leu Glu Glu Leu Glu Glu Lys Ser Thr Thr Pro Pro Pro Ala 
85 90 95 
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Glu Pro Val Ser Leu Pro Gin Glu Pro Pro Lys Pro Arg Val Glu Ala 
100 105 110 

Lys Pro Glu Val Gin Ser Gin Pro Pro Arg Val Arg Glu Gin Arg Pro 
115 120 125 

Arg Glu Arg Pro Gly Phe Pro Pro Arg Gly Pro Arg Pro Gly Arg Gly 
130 * 135 140 

Asp Met Glu Gin Asn Asp Ser Asp Asn Arg Arg He He Arg Tyr Pro 
145 150 155 160 

Asp Ser His Gin Leu Phe Val Gly Asn Leu Pro His Asp He Asp Glu 
165 170 175 

Asn Glu Leu Lys Glu Phe Phe Met Ser Phe Gly Asn Val Val Glu Leu 
180 185 190 

Arg He Asn Thr Lys Gly Val Gly Gly Lys Leu Pro Asn Phe Gly Phe 
195 200 205 

Val Val Phe Asp Asp Ser Glu Pro Val Gin Arg He Leu He Ala Lys 
210 215 220 

Pro He Met Phe Arg Gly Glu Val Arg Leu Asn Val Glu Glu Lys Lys 
225 230 235 240 

Thr Arg Ala Ala Arg Glu Arg Glu Thr Arg Gly Gly Gly Asp Asp Arg 
245 250 255 

Arg Asp He Arg Arg Asn Asp Arg Gly Pro Gly Gly Pro Arg Gly He 
260 265 270 



Val Gly Gly Gly Met Met Arg Asp Arg Asp Gly Arg Gly Pro Pro Pro 
275 280 285 

Arg Gly Gly Met Ala Gin Lys Leu Gly Ser Gly Arg Gly Thr Gly Gin 
290 295 300 



Met Glu Gly Arg Phe Thr Gly Gin Arg Arg 
305 310 



<210> 138 
<211> 169 
<212> PRT 
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<213> Homo sapien 
<400> 138 

Met Glu Gin Asn Asp Ser Asp Asn Arg Arg He He Arg Tyr Pro Asp 
15 10 15 

Ser His Gin Leu Phe Val Gly Asn Leu Pro His Asp He Asp Glu Asn 
20 25 30 

Glu Leu Lys Glu Phe Phe Met Ser Phe Gly Asn Val Val Glu Leu Arg 
35 40 45 

He Asn Thr Lys Gly Val Gly Gly Lys Leu Pro Asn Phe Gly Phe Val 
50 55 60 

Val Phe Asp Asp Ser Glu Pro Val Gin Arg He Leu He Ala Lys Pro 
65 70 75 80 

He Met Phe Arg Gly Glu Val Arg Leu Asn Val Glu Glu Lys Lys Thr 
85 90 95 



Arg Ala Ala Arg Glu Arg Glu Thr Arg Gly Gly Gly Asp Asp Arg Arg 
100 105 HO 

Asp He Arg Arg Asn Asp Arg Gly Pro Gly Gly Pro Arg Gly He Val 
115 120 125 



Gly Gly Gly Met Met Arg Asp Arg Asp Gly Arg Gly Pro Pro Pro Arg 
13 0 135 14 0 

Gly Gly Met Ala Gin Lys Leu Gly Ser Gly Arg Gly Thr Gly Gin Met 
145 * 150 155 160 



Glu Gly Arg Phe Thr Gly Gin Arg Arg 
165 



<210> 139 

<211> 147 

<212> PRT 

<213> Homo sapien 

<400> 139 

Met Gly Arg Val Arg Thr Lys Thr Val Lys Lys Ala Ala Arg Val He 
15 10 15 



He Glu Lys Tyr Tyr Thr Arg Leu Gly Asn Asp Phe His Thr Asn Lys 
20 25 30 
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Arg Val Cys Glu Glu lie Ala lie lie Pro Ser Lys Lys Leu Arg Asn 
35 40 45 



Lys lie Ala Gly Tyr Val Thr His Leu Met Lys Arg He Gin Arg Gly 
50 55 60 



Pro Val Arg Gly He Ser He Lys Leu Gin Glu Glu Glu Arg Glu Arg 
65 70 75 80 



Arg Asp Asn Tyr Val Pro Glu Val Ser Ala Leu Asp Gin Glu He He 
85 90 95 



Glu Val Asp Pro Asp Thr Lys Glu Met Leu Lys Leu Leu Asp Phe Gly 
100 105 HO 



Ser Leu Ser Asn Leu Gin Val He His Pro Asn Cys Arg Leu Ser Asp 
115 120 125 



Leu Lys Val Gly Gin Thr Ala Val Gly Met Asn Phe Lys Thr Pro Arg 
130 135 140 



Gly Pro Val 
145 



<210> 140 

<211> 166 

<212> PRT 

<213> Homo sapien 

<220> ' 

< 2 2 1 > MIS C_FEATURE 

<222> (129) . . (129) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (134) . . (134) 

<223> X=any amino acid 



<400> 140 

Ala Leu Thr Gly Phe Ala Cys Ala Ser Cys Phe Leu Phe Tyr Gin Gly 
15 10 15 



Pro Ala Asn Met Gly Arg Val Arg Thr Lys Thr Val Lys Lys Ala Ala 
20 25 30 
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Arg Val lie lie Glu Lys Tyr Tyr Thr Arg Leu Gly Asn Asp Phe His 
35 40 45 

Thr Asn Lys Arg Val Cys Glu Glu He Ala He He Pro Ser Lys Lys 
50 55 60 

Leu Arg Asn Lys He Ala Gly Tyr Val Thr His Leu Met Lys Arg He 
65 70 75 80 

Gin Arg Gly Pro Val Arg Gly He Ser He Lys Leu Gin Glu Glu Glu 
85 90 95 

Arg Glu Arg Arg Asp Asn Tyr Val Pro Glu Val Ser Ala Leu Asp Gin 
100 105 HO 



Glu He He Glu Val Asp Pro Asp Thr Lys Glu Met Leu Lys Leu Leu 
115 120 125 



Xaa Phe Gly Ser Leu Xaa Asn Leu Gin Val He His Pro Asn Cys Arg 
130 135 140 

Leu Ser Asp Leu Lys Val Gly Gin Thr Ala Val Gly Met Asn Phe Lys 
145 150 155 160 



Thr Pro Arg Gly Pro Val 
165 



<210> 141 

<211> 254 

<212> PRT 

<213> Homo sapien 

<400> 141 

Met Ser Val Asn Ala He Arg Lys Gin Ser Thr Asp Glu Glu Val Thr 
15 10 15 

Ser Leu Ala Lys Ser Leu He Lys Ser Trp Lys Lys Leu Leu Asp Gly 
20 25 30 

Pro Ser Thr Glu Lys Asp Leu Asp Glu Lys Lys Lys Glu Pro Ala He 
35 40 45 



Thr Ser Gin Asn Ser Pro Glu Ala Arg Glu Glu Ser Thr Ser Ser Gly 
50 55 60 



Asn Val Ser Asn Arg Lys Asp Glu Thr Asn Ala Arg Asp Thr Tyr Val 
65 70 75 80 
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Ser Ser Phe Pro Arg Ala Pro Ser Thr Ser Asp Ser Val Arg Leu Lys 
85 90 95 



Cys Arg Glu Met Leu Ala Ala Ala Leu Arg Thr Gly Asp Asp Tyr lie 
100 105 110 



Ala lie Gly Ala Asp Glu Glu Glu Leu Gly Ser Gin lie Glu Glu Ala 
115 120 125 



He Tyr Gin Glu He Arg Asn Thr Asp Met Lys Tyr Lys Asn Arg Val 
130 , 135 140 



Arg Ser Arg He Ser Asn Leu Lys Asp Ala Lys Asn Pro Asn Leu Arg 
145 150 155 160 



Lys Asn Val Leu Cys Gly Asn He Pro Pro Asp Leu Phe Ala Arg Met 
165 170 175 



Thr Ala Glu Glu Met Ala Ser Asp Glu Leu Lys Glu Met Arg Lys Asn 
180 185 190 



Leu Thr Lys Glu Ala He Arg Glu His Gin Met Ala Lys Thr Gly Gly 
195 200 205 



Thr Gin Thr Asp Leu Phe Thr Cys Gly Lys Cys Lys Lys Lys Asn Cys 
210 215 220 



Thr Tyr Thr Gin Val Gin Thr Arg Ser Ala Asp Glu Pro Met Thr Thr 
225 230 235 240 



Phe Val Val Cys Asn Glu Cys Gly Asn Arg Trp Lys Phe Cys 
245 250 



<210> 142 

<211> 302 

<212> PRT 

<213> Homo sapien 

<400> 142 

Arg Gly Leu Asn Val Arg Leu Val He Ser Thr Val Leu His Val Cys 
1 5 10 15 



Leu Ala He Lys Asn Ala Ala Gly Ala Leu Asp Leu Leu Lys Glu Leu 
20 25 30 
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Lys Asn He Pro Met Thr Leu Glu Leu Leu Gin Ser Thr Arg He Gly 
35 40 45 



Met Ser Val Asn Ala He Arg Lys Gin Ser Thr Asp Glu Glu Val Thr 
50 55 60 



Ser Leu Ala Lys Ser Leu He Lys Ser Trp Lys Lys Leu Leu Asp Gly 
65 70 75 80 



Pro Ser Thr Glu Lys Asp Leu Asp Glu Lys Lys Lys Glu Pro Ala He 
85 90 95 



Thr Ser Gin Asn Ser Pro Glu Ala Arg Glu Glu Ser Thr Ser Ser Gly 
100 105 HO 



Asn Val Ser Asn Arg Lys Asp Glu Thr Asn Ala Arg Asp Thr Tyr Val 
115 120 125 



Ser Ser Phe Pro Arg Ala Pro Ser Thr Ser Asp Ser Val Arg Leu Lys 
130 135 140 



Cys Arg Glu Met Leu Ala Ala Ala Leu Arg Thr Gly Asp Asp Tyr He 
145 150 155 160 



Ala He Gly Ala Asp Glu Glu Glu Leu Gly Ser Gin He Glu Glu Ala 
165 170 175 



He Tyr Gin Glu He Arg Asn Thr Asp Met Lys Tyr Lys Asn Arg Val 
180 185 190 



Arg Ser Arg He Ser Asn Leu Lys Asp Ala Lys Asn Pro Asn Leu Arg 
195 200 205 



Lys Asn Val Leu Cys Gly Asn He Pro Pro Asp Leu Phe Ala Arg Met 
210 215 220 



Thr Ala Glu Glu Met Ala Ser Asp Glu Leu Lys Glu Met Arg Lys Asn 
225 230 235 240 



Leu Thr Lys Glu Ala He Arg Glu His Gin Met Ala Lys Thr Gly Gly 
245 250 255 



Thr Gin Thr Asp Leu Phe Thr Cys Gly Lys Cys Lys Lys Lys Asn Cys 
260 265 270 



Thr Tyr Thr Gin Val Gin Thr Arg Ser Ala Asp Glu Pro Met Thr Thr 
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275 280 285 



Phe Val Val Cys Asn Glu Cys Gly Asn Arg Trp Lys Phe Cys 
290 295 300 



<210> 143 

<211> 225 

<212> PRT 

<213> Homo sapien 

<400> 143 

Met Val Ser His Ser Glu Leu Arg Lys Leu Phe Tyr Ser Ala Asp Ala 
15 10 15 

Val Cys Phe Asp Val Asp Ser Thr Val lie Arg Glu Glu Gly lie Asp 
20 25 30 



Glu Leu Ala Lys lie Cys Gly Val Glu Asp Ala Val Ser Glu Met Thr 
35 40 45 

Arg Arg Ala Met Gly Gly Ala Val Pro Phe LyB Ala Ala Leu Thr Glu 
50 55 60 

Arg Leu Ala Leu lie Gin Pro Ser Arg Glu Gin Val Gin Arg Leu lie 
65 70 75 80 

Ala Glu Gin Pro Pro His Leu Thr Pro Gly He Arg Glu Leu Val Ser 
85 90 95 



Arg Leu Gin Glu Arg Asn Val Gin Val Phe Leu He Ser Gly Gly Phe 
100 105 HO 



Arg Ser He Val Glu His Val Ala Ser Lys Leu Asn He Pro Ala Thr 
115 120 125 



Asn Val Phe Ala Asn Arg Leu Lys Phe Tyr Phe Asn Gly Glu Tyr Ala 
130 135 140 



Gly Phe Asp Glu Thr Gin Pro Thr Ala Glu Ser Gly Gly Lys Gly Lys 
145 150 155 160 



Val He Lys Leu Leu Lys Glu Lys Phe His Phe Lys Lys He He Met 
165 170 175 



He Gly Asp Gly Ala Thr Asp Met Glu Ala Cys Pro Pro Ala Asp Ala 
180 185 190 
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Phe lie Gly Phe Gly Gly Asn Val lie Arg Gin Gin Val Lys Asp Asn 
195 200 205 



Ala Lys Trp Tyr He Thr Asp Phe Val Glu Leu Leu Gly Glu Leu Glu 
210 215 220 



Glu 
225 



<210> 144 

<211> 249 

<212> PRT 

<213> Homo sapien 

<400> 144 

Met Lys Gin Thr Asn He Gin Lys Asn Thr Asn Thr Arg Asp Thr Ser 
15 10 15 



Lys Lys Thr Lys Asp Gin Leu He He Asp Ala Gly Gin Lys His Phe 
20 25 30 



Gly Ala Thr Val Cys Lys Ser Cys Gly Met He Tyr Thr Ala Ser Asn 
35 40 45 



Pro Glu Asp Glu Met Gin His Val Gin His His His Arg Phe Leu Glu 
50 55 60 



Gly He Lys Tyr Val Gly Trp Lys Lys Glu Arg Val Val Ala Glu Phe 
65 70 75 80 



Trp Asp Gly Lys He Val Leu Val Leu Pro His Asp Pro Ser Phe Ala 
85 90 95 



He Lys Lys Val Glu Asp Val Gin Glu Leu Val Asp Asn Glu Leu Gly 
100 105 HO 



Phe Gin Gin Val Val Pro Lys Cys Pro Asn Lys He Lys Thr Phe Leu 
115 120 125 



Phe He Ser Asp Glu Lys Arg Val Val Gly Cys Leu He Ala Glu Pro 
130 135 140 



He Lys Gin Ala Phe Arg Val Leu Ser Glu Pro He Gly Pro Glu Ser 
145 150 155 160 



Pro Ser Ser Thr Glu Cys Pro Arg Ala Trp Gin Cys Ser Asp Val Pro 
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165 170 175 



Glu Pro Ala Val Cys Gly He Ser Arg He Trp Val Phe Arg Leu Lys 
180 185 190 



Arg Arg Lys Arg He Ala Arg Arg Leu Val Asp Thr Leu Arg Asn Cys 
195 200 205 



Phe Met Phe Gly Cys Phe Leu Ser Thr Asp Glu He Ala Phe Ser Asp 
210 215 220 



Pro Thr Pro Asp Gly Lys Leu Phe Ala Thr Lys Tyr Cys Asn Thr Pro 
225 ~ 230 235 240 



Asn Phe Leu Val Tyr Asn Phe Asn Ser 
245 



<210> 145 

<211> 113 

<212> PRT 

<213> Homo sapien 

<400> 145 

Met Lys Ser Phe Ser Lys Ser Ser Asn Lys Cys Thr Leu Asn Thr Ser 
1 5 10 15 



Thr Val Arg Glu Phe Leu Ser Phe Arg Met Asn Ala He His Thr Lys 
20 25 30 



Glu Leu Leu Leu Thr Ser His Leu Gin Ser Pro Pro Gly His Arg Gin 
35 40 45 



Asp Pro Phe Asn Lys Ser Ser Ser Glu Thr Pro He Val Gin Asn Leu 
50 55 60 



Gin Leu Ala Thr Gly Tyr His His Ser Leu Trp Leu Cys Lys He Lys 
65 70 75 80 



Asp Leu Glu Glu Gly Trp Gly Gly Gly Ser Tyr Glu Lys Arg Gin Glu 
85 90 95 



Lys Ser Ser Phe Asp Pro Met Leu Ser Glu Ser Val His Glu Glu Glu 
100 105 . 110 



Ser 
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<210> 146 
<211> 102 
<212> PRT 
<213> Homo sapien 

<400> 146 

Met Val Thr Glu Glu Lys Arg Ser Glu Ala Arg Glu Asn Glu Arg Ser 
! 5 10 15 

Leu Ala Phe Val Lys Met Val Gly His His Val Ala Phe Leu Glu Ala 
20 25 30 

Asp Val Leu Gin Ala Glu Arg Asp His Gly Ala Phe Pro Gin Ala Leu 
35 40 45 



Arg Arg Trp Leu Gly Ser Ala Gly Leu Pro Ser Phe Arg Asn Lys Ser 
50 55 60 

Pro Ala Pro Val Pro Val Thr Tyr Glu Leu Pro Thr Leu Tyr Arg Thr 
65 70 75 80 

Glu Asp Tyr Phe Pro Val Asp Ala Gly Glu Ala Gin His His Pro Arg 
85 90 95 



Thr Cys Pro Arg Pro Leu 
100 



<210> 147 

<211> 412 

<212> PRT 

<213> Homo sapien 

<400> 147 

Met Thr His Arg Arg Phe Lys Val Thr Ser Thr Val Ala Ala Ala Ser 
15 10 15 

Leu Leu Pro Leu Gin Asp Glu Lys Glu Val Leu Leu Cys Lys Pro Ala 
20 25 30 

Trp Leu Ser Pro Ser Gly Thr Arg Thr Gly Gly Phe Leu Ala Val Pro 
35 40 45 

Gly Pro Pro Leu Arg Ala Lys Gly Pro Pro Val Leu Trp Pro Pro Pro 
50 55 60 



Ala His Pro Pro Arg Val Pro Gly Arg Glu His Ser Arg Trp Gly Arg 
65 70 75 80 
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Ser Pro Pro Ala Gin Arg Ala Ala Leu Gly Leu Arg Pro Tyr Leu Leu 
85 90 95 



Leu Leu Leu Pro Pro Ala Gin Leu Phe Asn Val Tyr Pro Trp Leu Gly 
100 105 HO 



Ala Leu Leu Gin Leu His Arg Pro Val Leu Arg Lys lie Glu Glu Val 
115 120 125 



Arg Ala lie Leu Arg Thr Leu Leu Glu Ala Arg Arg Pro His Val Cys 
130 135 140 

Pro Gly Asp Pro Val Cys Ser Tyr Val Asp Ala Leu lie Gin Gin Gly 
145 " 150 155 160 

Gin Gly Asp Asp Pro Glu Gly Leu Phe Ala Glu Ala Asn Ala Val Ala 
165 170 175 



Cys Thr Leu Asp Met Val Met Ala Gly Thr Glu Thr Thr Ser Ala Thr 
180 185 190 



Leu Gin Trp Ala Ala Leu Leu Met Gly Arg His Pro Asp Val Gin Gly 
195 200 205 



Glu Thr Pro Ala Pro Gly Glu Thr Ala Pro Ser Ala Pro Gly Gly Pro 
210 215 220 



Pro Gly Thr Arg Asp Gly Ala Ala Thr Gin Ala Ala Gin Pro Phe Ala 
225 230 235 240 



Pro Gly Arg Val Gin Glu Glu Leu Asp Arg Val Leu Gly Pro Gly Arg 
245 250 255 



Thr Pro Arg Leu Glu Asp Gin Gin Ala Leu Pro Tyr Thr Ser Ala Val 
260 265 270 



Leu His Glu Val Gin Arg Phe lie Thr Leu Leu Pro His Val Pro Arg 
275 280 285 



Cys Thr Ala Ala Asp Thr Gin Leu Gly Gly Phe Leu Leu Pro Lys Gly 
290 295 300 



Thr Pro Val lie Pro Leu Leu Thr Ser Val Leu Leu Asp Glu Thr Gin 
305 310 315 320 
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Trp Gin Thr Pro Gly Gin Phe Asn Pro Gly His Phe Leu Asp Ala Asn 
325 330 335 

Gly His Phe Val Lys Arg Glu Ala Phe Leu Pro Phe Ser Ala Gly Arg 
340 345 350 

Arg Val Cys Val Gly Glu Arg Leu Ala Arg Thr Glu Leu Phe Leu Leu 
355 ^ 360 365 

Phe Ala Gly Leu Leu Gin Arg Tyr Arg Leu Leu Pro Pro Pro Gly Val 
370 375 380 

Ser Pro Ala Ser Leu Asp Thr Thr Pro Ala Arg Ala Phe Thr Met Arg 
385 390 395 400 

Pro Arg Ala Gin Ala Leu Cys Ala Val Pro Arg Pro 
405 410 



<210> 148 

<211> 203 

<212> PRT 

<213> Homo sapien 

<400> 148 

Asp Pro Gly Ala Trp Arg Asp Gly Ser Phe Cys Pro Arg Gly Thr Pro 
15 10 15 

Arg Asp Glu Gly Trp Arg Cys His Pro Ser Gly Pro Pro Phe Ala Pro 
20 25 30 

Gly Arg Val Gin Glu Glu Leu Asp Arg Val Leu Gly Pro Gly Arg Thr 
35 40 45 

Pro Arg Leu Glu Asp Gin Gin Ala Leu Pro Tyr Thr Ser Ala Val Leu 
50 55 60 

His Glu Val Gin Arg Phe He Thr Leu Leu Pro His Val Pro Arg Cys 
65 70 75 80 

Thr Ala Ala Asp Thr Gin Leu Gly Gly Phe Leu Leu Pro Lys Gly Thr 
85 90 95 

Pro Val He Pro Leu Leu Thr Ser Val Leu Leu Asp Glu Thr Gin Trp 
100 105 11° 



Gin Thr Pro Gly Gin Phe Asn Pro Gly His Phe Leu Asp Ala Asn Gly 
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115 120 125 



His Phe Val Lys Arg Glu Ala Phe Leu Pro Phe Ser Ala Gly Arg Arg 
130 135 140 



Val Cys Val Gly Glu Arg Leu Ala Arg Thr Glu Leu Phe Leu Leu Phe 
145 150 155 160 

Ala Gly Leu Leu Gin Arg Tyr Arg Leu Leu Pro Pro Pro Gly Val Ser 
165 170 175 



Pro Ala Ser Leu Asp Thr Thr Pro Ala Arg Ala Phe Thr Met Arg Pro 
180 185 190 



Arg Ala Gin Ala Leu Cys Ala Val Pro Arg Pro 
195 200 



<210> 149 

<211> 116 

<212> PRT 

<213> Homo sapien 

<400> 149 

Met Ala Arg Asp lie Val Ala Met Ser Arg Ala Met Cys Leu Met Leu 
1 " 5 10 15 



Leu Ser Val Ala Arg Ala Phe Leu Leu Met Val Val Arg Thr Glu Glu 
20 25 30 



Val Ala Gly Phe Arg Trp Pro Asp Leu Arg Phe Asn Asp His His Asp 
35 40 45 



Thr Phe Ala Val Gly Cys Arg Leu His Ala His Ser Leu Ala Val Asn 
50 55 60 



Gin Ser Val Val Ala Glu Gly lie Ala Gly Pro Gin Val lie Gly Leu 
65 70 75 BO 



Ser Ala Val Val Phe Gly Leu Ser Phe Glu Asn Met Glu Asn Trp Ser 
85 90 95 



Ser Ser Ala Arg Pro lie Gin Leu Leu Met Pro Glu His Arg Tyr Ala 
100 105 110 



Asp lie Arg Gin 
115 
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<210> 150 
<211> 141 
<212> PRT 
<213> Homo sapien 

<400> 150 

Gly Glu Arg Pro Leu Ser Trp Ser Pro Leu Gly Arg Gly His Leu Cys 
15 10 15 



Leu Val Pro Leu Gly Gly Arg Arg Gly Ala Cys Ala Gly Lys Ser Arg 
20 25 30 



Arg Pro Arg Trp Ala Asp His Glu Val Arg Ser Ser Arg Pro Ala Trp 
35 40 45 



Pro Thr Trp His Thr Trp His Ala His Arg Gly Asp Val Ala Cys His 
50 55 60 



Val Ser Asp Ala Ala Glu Arg Gly Ala Ser lie Leu Val Asp Gly Gly 
65 70 75 80 



Pro His lie Gly Gly Gly Arg Leu Pro Leu Ala Gly Ser Pro Leu Asn 
85 90 95 



Asp His His Asp Thr Phe Ala Val Gly Cys Arg Leu His Ala His Ser 
100 105 110 



Leu Ala Val Asn Gin Ser Val Val Ala Glu Gly lie Ala Gly Pro Gin 
115 120 125 



Val lie Gly Leu Ser Ala Val Val Phe Gly Leu Ser Phe 
130 135 140 



<210> 151 

<211> 426 

<212> PRT 

<213> Homo sapien 

<400> 151 

Met Ser Pro Ala Pro Asp Ala Ala Pro Ala Pro Ala Ser lie Ser Leu 
15 10 15 



Phe Asp Leu Ser Ala Asp Ala Pra Val Phe Gin Gly Leu Ser Leu Val 
20 25 30 



Ser His Ala Pro Gly Glu Ala Leu Ala Arg Ala Pro Arg Thr Ser Cys 
35 40 45 
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Ser Gly Ser Gly Glu Arg Glu Ser Pro Glu Arg Lys Leu Leu Gin Gly 
50 55 60 



Pro Met Asp lie Ser Glu Lys Leu Phe Cys Ser Thr Cys Asp Gin Thr 
65 70 75 80 

Phe Gin Asn His Gin Glu Gin Arg Glu His Tyr Lys Leu Asp Trp His 
85 90 95 



Arg Phe Asn Leu Lys Gin Arg Leu Lys Asp Lys Pro Leu Leu Ser Ala 
100 105 110 



Leu Asp Phe Glu Lys Gin Ser Ser Thr Gly Asp Leu Ser Ser lie Ser 
115 120 125 



Gly Ser Glu Asp Ser Asp Ser Ala Ser Glu Glu Asp Leu Gin Thr Leu 
130 135 140 



Asp Arg Glu Arg Ala Thr Phe Glu Lys Leu Ser Arg Pro Pro Gly Phe 
145 150 155 160 



Tyr Pro His Arg Val Leu Phe Gin Asn Ala Gin Gly Gin Phe Leu Tyr 
165 170 175 



Ala Tyr Arg Cys Val Leu Gly Pro His Gin Asp Pro Pro Glu Glu Ala 
180 185 190 



Glu Leu Leu Leu Gin Asn Leu Gin Ser Arg Gly Pro Arg Asp Cys Val 
195 200 205 



Val Leu Met Ala Ala Ala Gly His Phe Ala Gly Ala lie Phe Gin Gly 
210 215 220 



Arg Glu Val Val Thr His Lys Thr Phe His Arg Tyr Thr Val Arg Ala 
225 230 235 240 



Lys Arg Gly Thr Ala Gin Gly Leu Arg Asp Ala Arg Gly Gly Pro Ser 
245 250 255 



His Ser Ala Gly Ala Asn Leu Arg Arg Tyr Asn Glu Ala Thr Leu Tyr 
260 265 270 



Lys Asp Val Arg Asp Leu Leu Ala Gly Pro Ser Trp Ala Lys Ala Leu 
275 280 285 
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Glu Glu Ala Gly Thr lie Leu Leu Arg Ala Pro Arg Ser Gly Arg Ser 
290 295 300 



Leu Phe Phe Gly Gly Lys Gly Ala Pro Leu Gin Arg Gly Asp Pro Arg 
305 310 315 320 



Leu Trp Asp lie Pro Leu Ala Thr Arg Arg Pro Thr Phe Gin Glu Leu 
325 330 335 



Gin Arg Val Leu His Lys Leu Thr Thr Leu His Val Tyr Glu Glu Asp 
340 345 350 



Pro Arg Glu Ala Val Arg Leu His Ser Pro Gin Thr His Trp Lys Thr 
355 360 365 



Val Arg Glu Glu Arg Lys Lys Pro Thr Glu Glu Glu He Arg Lys He 
370 375 380 



Cys Arg Asp Glu Lys Glu Ala Leu Gly Gin Asn Glu Glu Ser Pro Lys 
385 390 395 400 



Gin Gly Leu He Thr He Trp Gin Leu Ser Asp Leu Ser Phe Cys Pro 
405 410 415 



Lys Asn Ala Leu Ala Asn Ser Leu Leu Ser 
420 425 



<210> 152 

<211> 370 

<212> PRT 

<213> Homo sapien 

<400> 152 

Met Ser Pro Ala Pro Asp Ala Ala Pro Ala Pro Ala Ser He Ser Leu 
15 10 15 



Phe Asp Leu Ser Ala Asp Ala Pro Val Phe Gin Gly Leu Ser Leu Arg 
20 25 30 



Glu His Tyr Lys Leu Asp Trp His Arg Phe Asn Leu Lys Gin Arg Leu 
35 40 45 



Lys Asp Lys Pro Leu Leu Ser Ala Leu Asp Phe Glu Lys Gin Ser Ser 
50 55 60 



Thr Gly Asp Leu Ser Ser He Ser Gly Ser Glu Asp Ser Asp Ser Ala 
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65 70 75 80 

Ser Glu Glu Asp Leu Gin Thr Leu Asp Arg Glu Arg Ala Thr Phe Glu 
85 90 95 

Lys Leu Ser Arg Pro Pro Gly Phe Tyr Pro His Arg Val Leu Phe Gin 
100 105 HO 

Asn Ala Gin Gly Gin Phe Leu Tyr Ala Tyr Arg Cys Val Leu Gly Pro 
115 120 125 

His Gin Asp Pro Pro Glu Glu Ala Glu Leu Leu Leu Gin Asn Leu Gin 
130 135 140 

Ser Arg Gly Pro Arg Asp Cys Val Val Leu Met Ala Ala Ala Gly His 
145 * 150 155 160 



Phe Ala Gly Ala lie Phe Gin Gly Arg Glu Val Val Thr His Lys Thr 
165 170 175 



Phe His Arg Tyr Thr Val Arg Ala Lys Arg Gly Thr Ala Gin Gly Leu 
180 185 190 



Arg Asp Ala Arg Gly Gly Pro Ser His Ser Ala Gly Ala Asn Leu Arg 
195 * ~ 2Q5 

Arg Tyr Asn Glu Ala Thr Leu Tyr Lys Asp Val Arg Asp Leu Leu Ala 
210 215 220 

Gly Pro Ser Trp Ala Lys Ala Leu Glu Glu Ala Gly Thr lie Leu Leu 
225 230 235 240 

Arg Ala Pro Arg Ser Gly Arg Ser Leu Phe Phe Gly Gly Lys Gly Ala 
245 250 255 



Pro Leu Gin Arg Gly Asp Pro Arg Leu Trp Asp lie Pro Leu Ala Thr 
260 265 270 



Arg Arg Pro Thr Phe Gin Glu Leu Gin Arg Val Leu His Lys Leu Thr 
275 280 285 



Thr Leu His Val Tyr Glu Glu Asp Pro Arg Glu Ala Val Arg Leu- His 
290 295 300 



Ser Pro Gin Thr His Trp Lys Thr Val Arg Glu Glu Arg Lys Lys Pro 
305 310 315 320 



WO 2004/050900 



PCT/US2003/040131 



213 



Thr Glu Glu Glu He Arg Lys He Cys Arg Asp Glu Lys Glu Ala Leu 
325 330 335 

Gly Gin Asn Glu Glu Ser Pro Lys Gin Gly Leu He Thr He Trp Gin 

340 345 350 



Leu Ser Asp Leu Ser Phe Cys Pro Lys Asn Ala Leu Ala Asn Ser Leu 
355 360 365 



Leu Ser 
370 



<210> 153 

<211> 208 

<212> PRT 

<213> Homo sapien 

<400> 153 

Met Ser Pro Ala Pro Asp Ala Ala Pro Ala Pro Ala Ser He Ser Leu 
1 5 10 15 

Phe Asp Leu Ser Ala Asp Ala Pro Val Phe Gin Gly Leu Ser Leu Val 
20 25 30 



Ser His Ala Pro Gly Glu Ala Leu Ala Arg Ala Pro Arg Thr Ser Cys 
35 40 45 



Ser Gly Ser Gly Glu Arg Glu Ser Pro Glu Arg Lys Leu Leu Gin Gly 
50 55 60 

Pro Met Asp He Ser Glu Lys Leu Phe Cys Ser Thr Cys Asp Gin Thr 
65 70 75 80 



Phe Gin Asn His Gin Glu Gin Arg Glu His Tyr Lys Leu Asp Trp His 
85 90 95 



Arg Phe Asn Leu Lys Gin Arg Leu Lys Asp Lys Pro Leu Leu Ser Ala 
100 105 110 



Leu Asp Phe Glu Lys Gin Ser Ser Thr Gly Asp Leu Ser Ser He Ser 
115 120 125 



Gly Ser Glu Asp Ser Asp Ser Ala Ser Glu Glu Asp Leu Gin Thr Leu 
130 ~ " 135 140 
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Asp Arg Glu Arg Ala Thr Phe Glu Lys Leu Ser Arg Pro Pro Gly Phe 
145 ~ 150 155 160 

Tyr Pro His Arg Val Leu Phe Gin Asn Ala Gin Gly Gin Phe Leu Tyr 
165 170 175 



Ala Tyr Arg Cys Val Leu Gly Pro His Gin Arg Gin Val Thr Val Gin 
180 185 190 



Val Ala Trp Leu Thr Pro Ala Phe Cys Thr Pro Ser Leu Asp Phe Pro 
195 200 205 



<210> 154 

<211> 209 

<212> PRT 

<213> Homo sapien 

<400> 154 

Trp Thr Gin Leu Leu Met Cys Tyr Phe Tyr Leu Gly Asp Lys lie Lys 
15 10 15 



Thr lie Ser Phe Gin Ala Phe lie Leu Met His Leu Leu Leu Pro Ser 
20 25 30 



Glu Tyr Ser Leu Asp Gly Phe His Met Ser Gly Phe Ser Leu Gly Ser 
35 40 45 



Gly Ser Glu Gly Glu Asp Gly Phe Gin Val Glu Leu Glu Leu Val Glu 
50 55 60 



Leu Thr Val Gly Thr Leu Asp Leu Cys Glu Ser Glu Val Leu Pro Lys 
65 70 75 80 

Arg Arg Arg Arg Lys Arg Asn Lys Lys Glu Lys Ser Arg Asp Gin Glu 
85 90 95 



Ala Gly Ala His Arg Thr Leu Leu Gin Gin Thr Gin Glu Glu Glu Pro 
100 105 110 



Ser Thr Gin Ser Ser Gin Ala Val Ala Ala Pro Leu Gly Pro Leu Leu 
115 120 125 



Asp Glu Ala Lys Ala Pro Gly Gin Pro Glu Leu Trp Asn Ala Leu Leu 
130 ~ 135 140 



Ala Ala Cys Arg Ala Gly Asp Val Gly Val Leu Lys Leu Gin Leu Ala 
145 150 155 160 
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Pro Ser Pro Ala Asp Pro Arg Val Leu Ser Leu Leu Ser Ala Pro Leu 
165 170 175 



Gly Ser Gly Gly Phe Thr Leu Leu His Ala Ala Ala Ala Ala Gly Arg 
180 185 190 



Gly Ser Val Val Arg Leu Leu Leu Glu Ala Gly Ala Asp Pro Thr Val 
195 200 205 



Gin 



<210> 155 

<211> 125 

<212> PRT 

<213> Homo sapien 

<400> 155 

Met Ser Pro Ala Pro Asp Ala Ala Pro Ala Pro Ala Ser lie Ser Leu 
15 10 15 



Phe Asp Leu Ser Ala Asp Ala Pro Val Phe Gin Gly Leu Ser Leu Val 
20 25 30 



Ser His Ala Pro Gly Glu Ala Leu Ala Arg Ala Pro Arg Thr Ser Cys 
35 40 45 



Ser Gly Ser Gly Glu Arg Glu Ser Pro Glu Arg Lys Leu Leu Gin Gly 
50 55 60 



Pro Met Asp He Ser Glu Lys Leu Phe Cys Ser Thr Cys Asp Gin Thr 
65 70 75 80 



Phe Gin Asn His Gin Glu Gin Arg Glu His Tyr Lys Leu Asp Trp His 
85 90 95 



Arg Phe Asn Leu Lys Gin Arg Leu Lys Asp Lys Pro Leu Leu Ser Ala 
100 105 110 



Leu Asp Phe Glu Lys Gin Ser Ser Thr Gly Asp Glu Trp 
115 120 125 



<210> 156 

<211> 191 

<212> PRT 

<213> Homo sapien 
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<400> 156 



Glu Pro Ser Leu Asp Arg Pro Gly Asp Asp Gin Leu Val Leu Gly Gly 
15 10 15 



Gly Leu Cys Arg Val Glu Gly Ser Gin Val Pro Val Pro Ala Leu Ser 
20 25 30 



Pro Ala Thr Ala Pro Thr Ser Phe Glu Gly Pro Phe Gly Lys He Val 
35 40 45 

His Gin Val Arg Ala Ala He His Thr Pro Arg Phe Ser Lys Asp His 
50 55 60 



Lys Cys Ser Leu Val Phe Tyr He Leu Ser Pro His Phe Leu Asp Pro 
65 70 75 80 



Val Phe Leu Ser Thr Lys Ser His Ser Gin Arg Gin Pro Leu Leu Ala 
85 90 95 



Thr Leu Ser Ser Val Pro Gly Ala Pro Glu Pro Cys Pro Gin Asp Gly 
100 105 HO 



Ser Pro Ala Ser His Pro Leu His Pro Pro Leu Cys He Ser Thr Gly 
115 120 125 



Ala Thr Val Pro Tyr Phe Ala Glu Gly Ser Gly Gly Pro Val Pro Thr 
130 135 140 



Thr Ser Thr Leu He Leu Pro Pro Glu Tyr Ser Ser Trp Gly Tyr Pro 
145 150 155 160 



Tyr Gly Glu Ser Thr Ala Arg Ala Trp Gin Gly Gly Asp Ala Lys Ser 
165 170 175 



Pro Thr Gin Thr Leu Leu Ser Ser Arg Arg Gly Pro Thr Val Leu 
180 185 190 



<210> 157 

<211> 130 

<212> PRT 

<213> Homo sapien 

<400> 157 



Met Gly Cys Leu Leu Thr Gly Leu Pro Arg Thr Leu Pro Arg Trp Cys 
15 10 15 
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Cys Leu Ala Pro Gly Arg lie Pro Val Leu Ala Ala Ser Arg Gly Leu 
20 25 30 



Gly Arg Arg Leu Ala Gly Ala His Ala Ala lie Pro Phe Ala Ala He 
35 40 45 



Arg Val Thr Cys He Gly Ser Cys Gly Val Ser Asn Lys Ala Asn Asp 
50 55 60 



Thr Ala Trp Val Val Glu Glu Gly Tyr Phe Asn Ser Ser Leu Ser Leu 
65 70 75 80 



Ala Asp Lys Gly Lys Phe Gly Ser Gin Phe Pro Ser Gly Asp Pro Trp 
85 90 95 



Gly Gin Pro Leu Glu Trp Gly Leu Ser Val Leu Ser Ser Pro Phe Pro 
100 105 HO 



Arg Ser Ala Ser Trp He Trp His Trp Pro He Leu Ser Gin Gly Cys 
115 120 125 



Gly Pro 
13 0 



<210> 158 

<211> 340 

<212> PRT 

<213> Homo sapien 

<400> 158 

Pro Gly Glu Ala His Phe Arg Glu Asp His Trp Pro Ala Ala Gly Pro 
15 10 15 



Thr Arg Arg Arg Ser Ser Arg Pro Gly Val Pro Leu Gin Gly Ala Glu 
20 25 30 



Glu Asp Gly Ala Leu Trp Lys Gly Ala Arg Gly Phe Asn Gly Val Gin 
35 40 45 



Leu Phe Glu Gly Met Lys Ala Phe Lys Gly Lys Asp Gin Gin Val Arg 
50 55 60 



Leu Phe Arg Pro Trp Leu Asn Met Asp Arg Met Leu Arg Ser Ala Met 
65 70 75 80 



Arg Leu Cys Leu Pro Ser Phe Asp Lys Leu Glu Leu Leu Glu Cys He 
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85 90 95 



Arg Arg Leu lie Glu Val Asp Lys Asp Trp Val Pro Asp Ala Ala Gly 
100 105 HO 



Thr Ser Leu Tyr Val Arg Pro Val Leu lie Gly Asn Glu Pro Ser Leu 
115 120 125 



Gly Val Ser Gin Pro Thr Arg Ala Leu Leu Phe Val lie Leu Cys Pro 
130 135 140 



Val Gly Ala Tyr Phe Pro Gly Gly Ser Val Thr Pro Val Ser Leu Leu 
145 150 155 160 



Ala Asp Pro Ala Phe lie Arg Ala Trp Val Gly Gly Val Gly Asn Tyr 
165 170 175 



Lys Leu Gly Gly Asn Tyr Gly Pro Thr Val Leu Val Gin Gin Glu Ala 
180 185 190 



Leu Lys Arg Gly Cys Glu Gin Val Leu Trp Leu Tyr Gly Pro Asp His 
195 " 200 205 



Gin Leu Thr. Glu Val Gly Thr Met Asn lie Phe Val Tyr Trp Thr His 
210 215 220 



Glu Asp Gly Val Leu Glu Leu Val Thr Pro Pro Leu Asn Gly Val lie 
225 ^ "* 230 235 240 



Leu Pro Gly Val Val Arg Gin Ser Leu Leu Asp Met Ala Gin Thr Trp 
245 250 255 



Gly Glu Phe Arg Val Val Glu Arg Thr lie Thr Met Lys Gin Leu Leu 
260 265 270 



Arg Ala Leu Glu Glu Gly Arg Val Arg Glu Val Phe Gly Ser Gly Thr 
275 280 285 



Ala Cys Gin Val Cys Pro Val His Arg He Leu Tyr Lys Asp Arg Asn 
290 295 300 



Leu His He Pro Thr Met Glu Asn Gly- Pro Glu Leu He Leu Arg Phe 
305 310 315 320 



Gin Lys Glu Leu Lys Glu He Gin Tyr Gly He Arg Ala His Glu Trp 
325 330 335 
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Met Phe Pro Val 
340 



<210> 159 

<211> 306 

<212> PRT 

<213> Homo sapien 



<400> 159 

Met Ala Pro Phe Gly Lys Glu His Glu Ala Leu Met Gly Glu Leu Phe 
1 5 10 15 

Glu Gly Met Lys Ala Phe Lys Gly Lys Asp Gin Gin Val Arg Leu Phe 
20 25 30 



Arg Pro Trp Leu Asn Met Asp Arg Met Leu Arg Ser Ala Met Arg Leu 
35 40 45 



Cys Leu Pro Ser Phe Asp Lys Leu Glu Leu Leu Glu Cys lie Arg Arg 
50 55 60 



Leu lie Glu Val Asp Lys Asp Trp Val Pro Asp Ala Ala Gly Thr Ser 
65 70 75 80 



Leu Tyr Val Arg Pro Val Leu lie Gly Asn Glu Pro Ser Leu Gly Val 
85 90 95 



Ser Gin Pro Thr Arg Ala Leu Leu Phe Val lie Leu Cys Pro Val Gly 
100 105 HO 



Ala Tyr Phe Pro Gly Gly Ser Val Thr Pro Val Ser Leu Leu Ala Asp 
115 120 125 



Pro Ala Phe He Arg Ala Trp Val Gly Gly Val Gly Asn Tyr Lys Leu 
130 135 140 



Gly Gly Asn Tyr Gly Pro Thr Val Leu Val Gin Gin Glu Ala Leu Lys 
145 ~ * 150 155 160 



Arg Gly Cys Glu Gin Val Leu Trp Leu Tyr Gly Pro Asp His Gin Leu 
165 170 175 



Thr Glu Val Gly Thr Met Asn He Phe Val Tyr Trp Thr His Glu Asp 
180 185 190 
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Gly Val Leu Glu Leu Val Thr Pro Pro Leu Asn Gly Val lie Leu Pro 
195 200 205 

Gly Val Val Arg Gin Ser Leu Leu Asp Met Ala Gin Thr Trp Gly Glu 
210 215 220 

Phe Arg Val Val Glu Arg Thr lie Thr Met Lys Gin Leu Leu Arg Ala 
225 ~ 230 235 240 

Leu Glu Glu Gly Arg Val Arg Glu Val Phe Gly Ser Gly Thr Ala Cys 
245 250 255 



Gin Val Cys Pro Val His Arg He Leu Tyr Lys Asp Arg Asn Leu His 
260 265 270 



He Pro Thr Met Glu Asn Gly Pro Glu Leu He Leu Arg Phe Gin Lys 
275 280 285 



Glu Leu Lys Glu He Gin Tyr Gly He Arg Ala His Glu Trp Met Phe 
290 295 300 



Pro Val 
305 



<210> 160 

<211> 485 

<212> PRT 

<213> Homo sapien 

<400> 160 

Gin He Val Tyr Leu Tyr He Gin Arg He He Arg Val Phe His Gly 
15 10 15 

Val Asn Ala Pro Asp Asn Pro Leu Lys Glu Glu His Leu Val Gin Leu 
20 25 30 

Asn Glu Thr Asp He Leu Arg Val Leu Asp Gly Asn Thr Gly Gly Thr 
35 40 45 

Tyr Gly Gly His He Pro Gly Ser Asp Arg Ala Gly Leu Asn Arg His 
50 55 60 

Asp Lys Ser Glu Asn Pro Gly Arg He Tyr Ala Gly Gly Ser Ser Thr 
65 70 75 80 



Ala Ala Gly Asp Pro Ser Leu Leu Ser Ala Ala Arg He Met Ala Ala 
85 90 95 
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Ala Ala Leu Gly Gin lie Trp Ala Arg Lys Leu Leu Ser Val Pro Trp 
100 105 HO 



Leu Leu Cys Gly Pro Arg Arg Tyr Ala Ser Ser Ser Phe Lys Ala Ala 
115 120 125 



Asp Leu Gin Leu Glu Met Thr Gin Lys Pro His Lys Lys Pro Gly Pro 
130 135 140 



Gly Glu Pro Leu Val Phe Gly Lys Thr Phe Thr Asp His Met Leu Met 
145 150 155 160 



Val Glu Trp Asn Asp Lys Gly Trp Gly Gin Pro Arg lie Gin Pro Phe 
165 170 175 



Gin Asn Leu Thr Leu HiB Pro Ala Ser Ser Ser Leu His Tyr Ser Leu 
180 185 190 



Gin Leu Phe Glu Gly Met Lys Ala Phe Lys Gly Lys Asp Gin Gin Val 
195 200 205 



Arg Leu Phe Arg Pro Trp Leu Asn Met Asp Arg Met Leu Arg Ser Ala 
210 215 220 



Met Arg Leu Cys Leu Pro Ser Phe Asp Lys Leu Glu Leu Leu Glu Cys 
225 ^ 230 235 240 



lie Arg Arg Leu lie Glu Val Asp Lys Asp Trp Val Pro Asp Ala Ala 
245 250 255 



Gly Thr Ser Leu Tyr Val Arg Pro Val Leu lie Gly Asn Glu Pro Ser 
260 265 270 



Leu Gly Val Ser Gin Pro Thr Arg Ala Leu Leu Phe Val lie Leu Cys 
275 280 285 



Pro Val Gly Ala Tyr Phe Pro Gly Gly Ser Val Thr Pro Val Ser Leu 
290 295 300 



Leu Ala Asp Pro Ala Phe lie Arg Ala Trp Val Gly Gly Val Gly Asn 
305 310 315 320 



Tyr Lys Leu Gly Gly Asn Tyr Gly Pro Thr Val Leu Val Gin Gin Glu 
325 330 335 
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Ala Leu Lys Arg Gly Cys Glu Gin Val Leu Trp Leu Tyr Gly Pro Asp 
340 345 350 

His Gin Leu Thr Glu Val Gly Thr Met Asn He Phe Val Tyr Trp Thr 
355 360 365 

His Glu Asp Gly Val Leu Glu Leu Val Thr Pro Pro Leu Asn Gly Val 
370 375 380 

He Leu Pro Gly Val Val Arg Gin Ser Leu Leu Asp Met Ala Gin Thr 
385 390 395 400 

Trp Val Arg Thr Trp His Leu Leu Val Met Gly Ala Met Cys Gin Gly 
405 410 415 

Pro Gly His Gin Arg Ala Gly Thr Gly Ala His Trp His Val Ser Ala 
420 425 430 

Pro Ser Pro Gly Ser Val Ser Pro Val Gly Pro Leu Ser Phe Ser Leu 
435 440 445 

Ser Ser Gly Arg Glu Arg Trp Arg Ser Ala Ala Gly Gin Pro Ser Gly 
450 455 460 

Asp Thr Cys Leu Cys Gin Leu Pro Cys Arg Val Ser Ser Gly Trp Trp 
465 4 470 475 480 



Ser Ala Arg Ser Pro 
485 



<210> 161 

<211> 465 

<212> PRT 

<213> Homo sapien 

<400> 161 

Met Ala Ala Ala Ala Leu Gly Gin He Trp Ala Arg Lys Leu Leu Ser 
15 10 15 

Val Pro Trp Leu Leu Cys Gly Pro Arg Arg Tyr Ala Ser Ser Ser Phe 
20 25 30 

Lys Ala Ala Asp Leu Gin Leu Glu Met Thr Gin Lys Pro His Lys Lys 
35 40 45 



Pro Gly Pro Gly Glu Pro Leu Val Phe Gly Lys Thr Phe Thr Asp His 
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50 55 60 

Met Leu Met Val Glu Trp Asn Asp Lys Gly Trp Gly Gin Pro Arg He 
65 70 75 80 

Gin Pro Phe Gin Asn Leu Thr Leu His Pro Ala Ser Ser Ser Leu His 
85 90 95 

Tyr Ser Leu Gin Leu Phe Glu Gly Met Lys Ala Phe Lys Gly Lys Asp 
100 105 HO 

Gin Gin Val Arg Leu Phe Arg Pro Trp Leu Asn Met Asp Arg Met Leu 
115 "* 120 125 

Arg Ser Ala Met Arg Leu Cys Leu Pro Ser Phe Asp Lys Leu Glu Leu 
130 135 140 

Leu Glu Cys He Arg Arg Leu He Glu Val Asp Lys Asp Trp Val Pro 
145 150 155 160 

Asp Ala Ala Gly Thr Ser Leu Tyr Val Arg Pro Val Leu He Gly Asn 
165 170 175 

Glu Pro Ser Leu Gly Val Ser Gin Pro Thr Arg Ala Leu Leu Phe Val 
180 185 190 

He Leu Cys Pro Val Gly Ala Tyr Phe Pro Gly Gly Ser Val Thr Pro 
195 200 205 

Val Ser Leu Leu Ala Asp Pro Ala Phe He Arg Ala Trp Val Gly Gly 
210 215 220 

Val Gly Asn Tyr Lys Leu Gly Gly Asn Tyr Gly Pro Thr Val Leu Val 
225 * 230 235 240 

Gin Gin Glu Ala Leu Lys Arg Gly Cys Glu Gin Val Leu Trp Leu Tyr 
245 250 255 

Gly Pro Asp His Gin Leu Thr Glu Val Gly Thr Met Asn He Phe Val 
260 265 270 

Tyr Trp" Thr His Glu Asp Gly Val Leu Glu Leu Val Thr Pro Pro Leu 
275 280 285 

Asn Gly Val He Leu Pro Gly Val Val Arg Gin Ser Leu Leu Asp Met 
290 295 300 
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Ala Gin Thr Trp Val Glu Asp Met Ala Ser Ser Gly Asp Gly Arg His 
305 * 310 315 320 

Val Pro Gly Ala Arg Ala Ser Glu Gly Trp Asp Trp Gly Thr Leu Ala 
325 330 335 

Cys Leu Cys Pro Phe Ser Trp Val Cys Leu Ser Arg Trp Ala Ser Val 
340 345 350 



Phe Leu Thr lie Leu Arg Glu Gly Glu Val Glu Val Cys Ser Arg Ala 
355 360 365 

Thr Leu Trp Gly His Val Ser Leu Pro Thr Ala Leu Gin Gly Glu Phe 
370 " 375 380 

Arg Val Val Glu Arg Thr lie Thr Met Lys Gin Leu Leu Arg Ala Leu 
385 390 395 400 

Glu Glu Gly Arg Val Arg Glu Val Phe Gly Ser Gly Thr Ala Cys Gin 
405 410 415 



Val 1 Cys Pro Val His Arg He Leu Tyr Lys Asp Arg Asn Leu His He 
420 425 430 



Pro Thr Met Glu Asn Gly Pro Glu Leu He Leu Arg Phe Gin Lys Glu 
435 440 445 

Leu Lys Glu He Gin Tyr Gly He Arg Ala His Glu Trp Met Phe Pro 
450 455 460 



Val 
465 



<210> 162 

<211> 74 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MISC_FEATURE 

<222> (3) . . (5) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (8) . . (10) 

<223> X=any amino acid 
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<220> 

<221> MISC_FEATURE 

<222> (13) . . (14) 

<223> X=any amino acid 



<400> 162 



Thr Leu Xaa Xaa Xaa Phe Asn Xaa Xaa Xaa Ser Ser Xaa Xaa Lys lie 
15 10 15 



Arg Lys Asn Thr Ala Ser Tyr Val Pro Lys Glu Lys Lys He Lys Gly 
20 25 30 



Thr Met Pro Thr Cys Ser Thr He Lys Ala Ser Phe Ser Tyr Phe Phe 
35 40 45 



Asn Thr Lys Tyr Lys Gin Arg He His He Leu Lys Thr Glu Leu Arg 
50 * 55 60 



Ser Arg His Ala Val Leu Glu Thr Leu Gin 
65 70 



<210> 163 

<211> 63 

<212> PRT 

<213> Homo sapien 

<400> 163 

Thr Arg Val Leu Gin Cys Ala Pro Arg Cys Ser Pro Asn Cys Val Ala 
15 10 15 



Phe Thr Ala Ala Val Thr Val Pro Ala Cys He Tyr Ala Leu Phe Gly 
20 25 30 



Pro Cys Glu His Thr Gly He Leu Val He Leu Pro Pro Met Glu Tyr 
35 40 45 



Leu Trp Arg Ser Pro Val Phe He Tyr Phe Gly He Asn Pro Leu 
50 55 60 



<210> 164 

<211> 313 

<212> PRT 

<213> Homo sapien 

<400> 164 



Met Lys Cys Glu His Cys Thr Arg Lys Glu Cys Ser Lys Lys Thr Lys 
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15 10 15 

Thr Asp Asp Gin Glu Asn Val Ser Ala Asp Ala Pro Ser Pro Ala Gin 
20 25 30 

Glu Asn Gly Glu Lys Gly Glu Phe His Lys Leu Ala Asp Ala Lys He 
35 40 45 

Phe Leu Ser Asp Cys Leu Ala Cys Asp Ser Cys Met Thr Ala Glu Glu 
50 ~ 55 60 

Gly Val Gin Leu Ser Gin Gin Asn Ala Lys Asp Phe Phe Arg Val Leu 
65 70 75 80 

Asn Leu Asn Lys Lys Cys Asp Thr Ser Lys His Lys Val Leu Val Val 
85 90 95 

Ser Val Cys Pro Gin Ser Leu Pro Tyr Phe Ala Ala Lys Phe Asn Leu 
100 105 HO 

Ser Val Thr Asp Ala Ser Arg Arg Leu Cys Gly Phe Leu Lys Ser Leu 
115 120 125 

Gly Val His Tyr Val Phe Asp Thr Thr He Ala Ala Asp Phe Ser He 
130 135 140 

Leu Glu Ser Gin Lys Glu Phe Val Arg Arg Tyr Arg Gin His Ser Glu 
145 150 155 160 

Glu Glu Arg Thr Leu Pro Met Leu Thr Ser Ala Cys Pro Gly Trp Val 
165 170 175 

Arg Tyr Ala Glu Arg Val Leu Gly Arg Pro He Thr Ala His Leu Cys 
180 185 190 

Thr Ala Lys Ser Pro Gin Gin Val Met Gly Ser Leu Val Lys Asp Tyr 
195 200 205 

Phe Ala Arg Gin Gin Asn Leu Ser Pro Glu Lys He Phe His Val He 
210 215 220 

Val Ala Pro Cys Tyr Asp Lys Lys Leu Glu Ala Leu Gin Glu Ser Leu 
225 230 235 240 

Pro Pro Ala Leu His Gly Ser Arg Gly Ala Asp Cys Val Leu Thr Ser 
245 250 255 
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Gly Glu lie Ala Gin lie Met Glu Gin Gly Asp Leu Ser Val Arg Asp 
260 265 270 

Ala Ala Val Asp Thr Leu Val Ser Gly Phe Ser Gly Glu Ser Pro Leu 
275 280 285 

Gly Gly Arg Thr Ser Arg Gin Pro Cys Gin Pro Pro Ala Arg Pro Arg 
290 295 300 



Ala Ala Leu Leu Tyr Asp Glu Ala Met 
305 310 



<210> 165 

<211> 395 

<212> PRT 

<213> Homo sapien 

<400> 165 

Glu Pro Arg Val Arg Arg Val Ser Asn Ala Glu Leu Ala Asp Arg Ala 
1 5 10 15 

Arg Pro Arg Pro Pro Arg Ala Gin Gly Pro Pro Gly Pro Val Thr Thr 
20 25 3 0 

Gly Pro Ser Thr Leu Glu Arg Pro Gin Leu Gly Leu Gly Thr Val Arg 
35 40 45 

Ala Leu Thr Asp Ser Leu Val Asn Ala Ala Trp Pro Pro Pro Pro Pro 
50 55 60 

Gin Asp Pro Arg Glu Ala Glu Thr Gly Ala Arg Thr Arg Ser Pro Arg 
65 70 75 80 

Arg Arg Thr Trp Ser Glu Pro Ala Ala Pro Pro Arg Ala Leu Arg Leu 
85 90 95 

Ala Leu Gly Pro Gly Pro Pro Leu Pro Asp Thr Val lie Gly Leu Gly 
100 105 HO 

Lys Ala Val Arg Val Gly Asn Pro He Gly Pro Gly Val Arg Leu Val 
115 120 125 



Leu Ser Arg Cys Ser His Trp Pro Ser Ala Ala Val Gly Gly Glu Ala 
130 ~ 135 140 
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Ala Ser Gly Glu Asp Asn Lys Gly Pro Arg Ala Ala Gly Ser Gly Val 
145 ^ 150 155 160 

Pro Val Ser Arg Cys Phe Pro Glu Ala Glu Ala Pro Gly Leu Pro Pro 
165 170 175 



Ala Ala Leu Gin Met Lys Cys Glu His Cys Thr Arg Lys Glu Cys Ser 
180 185 190 

Lys Lys Thr Lys Thr Asp Asp Gin Glu Asn Val Ser Ala Asp Ala Pro 
195 200 205 

Ser Pro Ala Gin Glu Asn Gly Glu Lys Gly Glu Phe His Lys Leu Ala 
210 215 220 

Asp Ala Lys lie Phe Leu Ser Asp Cys Leu Ala Cys Asp Ser Cys Met 
225 230 235 240 

Thr Ala Glu Glu Gly Val Gin Leu Ser Gin Gin Asn Ala Lys Asp Phe 
245 250 255 

Phe Arg Val Leu Asn Leu Asn Lys Lys Cys Asp Thr Ser Lys His Lys 
260 265 270 

Val Leu Val Val Ser Val Cys Pro Gin Ser Leu Pro Tyr Phe Ala Ala 
275 280 285 

Lys Phe Asn Leu Ser Val Thr Asp Ala Ser Arg Arg Leu Cys Gly Phe 
290 295 300 

Leu Lys Ser Leu Gly Val His Tyr Val Phe Asp Thr Thr He Ala Ala 
305 310 315 320 

Asp Phe Ser He Leu Glu Ser Gin Lys Glu Phe Val Arg Arg Tyr Arg 
325 330 335 

Gin His Ser Glu Glu Glu Arg Thr Leu Pro Met Leu Thr Ser Ala Cys 
340 345 350 

Pro Gly Trp Val Arg Tyr Ala Glu Arg Val Leu Gly Arg Pro He Thr 
355 360 365 

Ala His Leu Cys Thr Ala Lys Ser Pro Gin Gin Val Met Gly Ser Leu 
370 375 380 



Val Lys Asp Tyr Phe Ala Arg Gin Gin Val Ser 
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385 390 395 



<210> 166 

<211> 285 

<212> PRT 

<213> Homo sapien 



<400> 166 

Met Gly Ser Ala Phe Pro Val Leu Pro Gly Gly Ser Thr Gly Trp Gly 
15 10 15 

Ala Leu Gin Met Phe Gly Arg Thr Thr Pro Ser Pro Glu Gly Gly Ser 
20 25 30 

Arg Gin Thr Trp Met Glu Cys Trp Cys Ser Asn Leu Ser Pro Glu Lys 
35 40 45 

lie Phe His Val lie Val Ala Pro Cys Tyr Asp Lys Lys Leu Glu Ala 
50 55 60 

Leu Gin Glu Ser Leu Pro Pro Ala Leu His Gly Ser Arg Gly Ala Asp 
65 70 75 80 

Cys Val Leu Thr Ser Gly Glu lie Ala Gin He Met Glu Gin Gly Asp 
85 90 95 



Leu Ser Val Arg Asp Ala Ala Val Asp Thr Leu Phe Gly Asp Leu Lys 
100 105 HO 

Glu Asp Lys Val Thr Arg His Asp Gly Ala Ser Ser Asp Gly His Leu 
115 120 125 

Ala His He Phe Arg His Ala Ala Lys Glu Leu Phe Asn Glu Asp Val 
130 135 140 

Glu Glu Val Thr Tyr Arg Ala Leu Arg Asn Lys Asp Phe Gin Glu Val 
145 150 155 160 

Thr Leu Glu Lys Asn Gly Glu Val Val Leu Arg Phe Ala Ala Ala Tyr 
165 170 175 



Gly Phe Arg Asn lie Gin Asn Met He Leu Lys Leu Lys Lys Gly Lys 
180 185 190 



Phe Pro Phe His Phe Val Glu Val Leu Ala Cys Ala Gly Gly Cys Leu 
195 200 205 
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Asn Gly Arg Gly Gin Ala Gin Thr Pro Asp Gly His Ala Asp Lys Ala 
210 215 220 



Leu Leu Arg Gin Met Glu Gly lie Tyr Ala Asp lie Pro Val Arg Arg 
225 230 235 240 



Pro Glu Ser Ser Ala His Val Gin Glu Leu Tyr Gin Glu Trp Leu Glu 
245 250 255 



Gly lie Asn Ser Pro Lys Ala Arg Glu Val Leu His Thr Thr Tyr Gin 
260 265 270 



Ser Gin Glu Arg Gly Thr His Ser Leu Asp lie Lys Trp 
275 280 285 



<210> 167 

<211> 170 

<212> PRT 

<213> Homo sapien 

<400> 167 

Asp Ser Val Ser His Pro Ala Lys Lys Phe Ala Met Ser Ala Ala Lys 
15 10 15 



Cys Lys Gin Ser Arg Leu Trp Val Lys Ala Leu Val Met Lys Asn Lys 
20 25 30 



Lys Lys Lys lie Pro Lys His Phe Leu Thr Leu Val Leu Lys Lys Thr 
35 40 45 



Pro Ser Pro Lys Val Cys Phe Ser Phe Asn Phe Leu Asp Phe Ala Arg 
50 55 60 



Leu Ser Gly Cys Lys Leu Pro Thr Leu Phe Trp Met Asp Arg Asp Lys 
65 70 75 80 



Val Phe Lys Gin Arg Leu Cys Pro Leu His Lys Pro Phe Pro Pro Pro 
85 90 95 



Pro Pro Gin Pro Pro Ala Ala lie lie Thr Gly Ala Val Lys Trp Leu 
100 105 110 



Leu Ser Asp Gly His Thr Thr Arg Arg Gin Met Lys Arg Ser Gly Ser 
115 120 125 



Lys Arg Gly Ser Ala Ser Gin Tyr Gin Pro Ala Val Pro Arg Gly Gly 
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130 135 140 

Ser Ala Gly Arg Thr Val Phe Pro Arg Gin Ala Ala Met Pro Pro Pro 
145 150 155 160 



Thr Ala Lys Ala Pro Lys Ala Thr Ser Val 
165 170 



<210> 168 

<211> 159 

<212> PRT 

<213> Homo sapien 

<400> 168 

Met Ser Ala Ala Lys Cys Lys Gin Ser Arg Leu Trp Val Lys Ala Leu 
15 10 15 

Val Met Lys Asn Lys Lys Lys Lys lie Pro Lys His Phe Leu Thr Leu 
20 25 30 

Val Leu Lys Lys Thr Pro Ser Pro Lys Val Cys Phe Ser Phe Asn Phe 
35 40 45 

Leu Asp Phe Ala Arg Leu Ser Gly Cys Lys Leu Pro Thr Leu Phe Trp 
50 55 60 

Met Asp Arg Asp Lys Val Phe Lys Gin Arg Leu Cys Pro Leu His Lys 
65 70 75 80 

Pro Phe Pro Pro Pro Pro Pro Gin Pro Pro Ala Ala lie lie Thr Gly 
85 90 95 



Ala Val Lys Trp Leu Leu Ser Asp Gly His Thr Thr Arg Arg Gin Met 
100 105 HO 

Lys Arg Ser Gly Ser Lys Arg Gly Ser Ala Ser Gin Tyr Gin Pro Ala 
115 120 125 

Val Pro Arg Gly Gly Ser Ala Gly Arg Thr Val Phe Pro Arg Gin Ala 
130 ~ 135 140 



Ala Met Pro Pro Pro Thr Ala Lys Ala Pro Lys Gly Asn lie Arg 
145 150 155 



<210> 169 
<211> 170 
<212> PRT 
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<213> Homo sapien 
<400> 169 

Asp Ser Val Ser His Pro Ala Lys Lys Phe Ala Met Ser Ala Ala Lys 
15 10 15 



Cys Lys Gin Ser Arg Leu Trp Val Lys Ala Leu Val Met Lys Asn Lys 
20 25 30 



Lys Lys Lys He Pro Lys His Phe Leu Thr Leu Val Leu Lys Lys Thr 
35 40 45 



Pro Ser Pro Lys Val Cys Phe Ser Phe Asn Phe Leu Asp Phe Ala Arg 
50 * 55 60 



Leu Ser Gly Cys Lys Leu Pro Thr Leu Phe Trp Met Asp Arg Asp Lys 
65 ~ 70 75 80 



Val Phe Lys Gin Arg Leu Cys Pro Leu His Lys Pro Phe Pro Pro Pro 
85 90 95 



Pro Pro Gin Pro Pro Ala Ala He He Thr Gly Ala Val Lys Trp Leu 
100 105 HO 



Leu Ser Asp Gly His Thr Thr Arg Arg Gin Met Lys Arg Ser Gly Ser 
115 120 125 



Lys Arg Gly Ser Ala Ser Gin Tyr Gin Pro Ala Val Pro Arg Gly Gly 
130 135 140 



Ser Ala Gly Arg Thr Val Phe Pro Arg Gin Ala Ala Met Pro Pro Pro 
145 * 150 155 160 



Thr Ala Lys Ala Pro Lys Ala Thr Ser Val 
165 170 



<210> 170 

<211> 255 

<212> PRT 

<213> Homo sapien 

<400> 170 

Gin Leu Leu Arg Asp Pro Asn Val Ala Leu Glu Leu Ser Ala Met Cys 
15 10 15 



Ser Thr Val Pro Trp Arg Arg Thr Leu Arg Glu Gly Gin Pro Cys His 
20 25 30 
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Leu Ser Leu Pro His His His Ser Pro Pro Pro lie Lys Leu Gin Ser 
35 40 45 

Gly Cys Trp Thr Pro Leu Gly Ala Val Ser Ala His His Pro Leu Cys 
50 55 60 

Ala Ala Thr Trp Ser Gin Ala His Cys Pro Leu Ala Gly Arg Gly Pro 
65 70 75 80 

Ser Arg Arg Arg Cys Gly Leu His Arg Ala Pro Ser Thr Lys Glu Ser 
85 90 95 

Ala Asn Ala Ser Ala Gly Pro Arg Ala Met Ala Ser Leu Pro Gin Leu 
100 105 HO 



Met Ala Ala Pro Thr Ser Ser Cys Thr Ser Leu Met Trp Lys Gly Ser 
115 120 125 

Met Ser Gin Trp Lys Ala Thr Arg Ser Pro He Lys Cys Ala Pro Ser 
130 135 140 

His Pro Arg Met Arg Ser Cys Arg Pro Trp Arg Ser Ser Ser Leu Thr 
145 ~ 150 155 160 

Trp His Gin Ala Pro Ser Met Arg Pro Gly Leu Asp Met Ser Ser Ala 
165 170 175 

Pro Arg Arg Trp Trp Lys His Pro Leu Ser Cys Ala Cys Gly Arg Leu 
180 185 190 

Cys Gly Glu Glu Ala Ala Asp Thr Gly Asp Asp He Leu Pro His Glu 
195 200 205 

Thr Gly Leu Gin Pro Gly Met Val Pro Leu Lys Tyr Leu Leu Glu Glu 
210 215 220 

Gly Val Trp Gly Ala Gly Val Gly Cys Gly Val Phe Pro Ala He Ser 
225 * 230 235 240 

Thr Ala Tyr Asp His Cys Asn Asn Leu Ser Pro Ser Glu Glu His 
245 250 255 



<210> 171 
<211> 147 
<212> PRT 
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<213> Homo sapien 
<400> 171 

Met Ser Ser Glu Pro Pro Pro Pro Pro Gin Pro Pro Thr His Gin Ala 
! 5 io 15 



Ser Val Gly Leu Leu Asp Thr Pro Arg Ser Arg Glu Arg Ser Pro Ser 
20 25 30 



Pro Leu Arg Gly Asn Val Val Pro Ser Pro Leu Pro Thr Arg Arg Thr 
35 40 45 

Arg Thr Phe Ser Ala Thr Val Arg Ala Ser Gin Gly Pro Val Tyr Lys 
50 55 SO 

Gly Val Cys Lys Cys Phe Cys Arg Ser Lys Gly His Gly Phe lie Thr 
65 ^ 70 75 80 

Pro Ala Asp Gly Gly Pro Asp lie Phe Leu His lie Ser Asp Val Glu 
85 90 95 



Gly Glu Tyr Val Pro Val Glu Gly Asp Glu Val Thr Tyr Lys Met Cys 
100 105 HO 



Ser lie Pro Pro Lys Asn Glu Lys Leu Gin Ala Val Glu Val Val lie 
115 120 125 



Thr His Leu Ala Pro Gly Thr Lys His Glu Thr Trp Ser Gly His Val 
130 135 140 



lie Ser Ser 
145 



<210> 172 

<211> 255 

<212> PRT 

<213> Homo sapien 

<400> 172 

Gin Leu Leu Arg Asp Pro Asn Val Ala Leu Glu Leu Ser Ala Met Cys 
15 10 15 

Ser Thr- Val Pro Trp Arg Arg Thr Leu Arg Glu Gly Gin Pro Cys His 
20 25 30 



Leu Ser Leu Pro His His His Ser Pro Pro Pro He Lys Leu Gin Ser 
35 40 45 
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Gly Cys Trp Thr Pro Leu Gly Ala Val Ser Ala His His Pro Leu Cys 
50 55 60 

Ala Ala Thr Trp Ser Gin Ala His Cys Pro Leu Ala Gly Arg Gly Pro 
65 70 75 80 

Ser Arg Arg Arg Cys Gly Leu His Arg Ala Pro Ser Thr Lys Glu Ser 
85 " 90 95 



Ala Asn Ala Ser Ala Gly Pro Arg Ala Met Ala Ser Leu Pro Gin Leu 
100 105 HO 

Met Ala Ala Pro Thr Ser Ser Cys Thr Ser Leu Met Trp Lys Gly Ser 
115 120 125 

Met Ser Gin Trp Lys Ala Thr Arg Ser Pro lie Lys Cys Ala Pro Ser 
130 135 140 

His Pro Arg Met Arg Ser Cys Arg Pro Trp Arg Ser Ser Ser Leu Thr 
145 150 155 160 

Trp His Gin Ala Pro Ser Met Arg Pro Gly Leu Asp Met Ser Ser Ala 
165 170 175 

Pro Arg Arg Trp Trp Lys His Pro Leu Ser Cys Ala Cys Gly Arg Leu 
180 185 190 

Cys Gly Glu Glu Ala Ala Asp Thr Gly Asp Asp lie Leu Pro His Glu 
195 200 205 

Thr Gly Leu Gin Pro Gly Met Val Pro Leu Lys Tyr Leu Leu Glu Glu 
210 215 220 

Gly Val Trp Gly Ala Gly Val Gly Cys Gly Val Phe Pro Ala lie Ser 
225 230 235 240 

Thr Ala Tyr Asp His Cys Asn Asn Leu Ser Pro Ser Glu Glu His 
245 ~ 250 255 



<210> 173 

<211> 243 

<212> PRT 

<213> Homo sapien 



<400> 



173 
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Leu Arg Thr Gly Arg Asn Ser Gly Gly Gly Gin Asn Gly Leu Gin Gly 
1 5 10 15 

Gin Pro Cys His Leu Ser Leu Pro His His His Ser Pro Pro Pro lie 
20 25 30 

Lys Leu Gin Ser Gly Cys Trp Thr Pro Leu Gly Ala Val Ser Ala His 
35 40 45 

His Pro Leu Cys Ala Ala Thr Trp Ser Gin Ala His Cys Pro Leu Ala 
50 55 60 

Gly Arg Gly Pro Ser Arg Arg Arg Cys Gly Leu His Arg Ala Pro Ser 
65 ~* 70 75 80 

Thr Lys Glu Ser Ala Asn Ala Ser Ala Gly Pro Arg Ala Met Ala Ser 
85 90 95 



Leu Pro Gin Leu Met Ala Ala Pro Thr Ser Ser Cys Thr Ser Leu Met 
100 105 HO 

Trp Lys Gly Ser Met Ser Gin Trp Lys Ala Thr Arg Ser Pro lie Lys 
115 120 125 

Cys Ala Pro Ser His Pro Arg Met Arg Ser Cys Arg Pro Trp Arg Ser 
130 135 140 



Ser Ser Leu Thr Trp His Gin Ala Pro Ser Met Arg Pro Gly Leu Asp 
145 150 155 160 

Met Ser Ser Ala Pro Arg Arg Trp Trp Lys His Pro Leu Ser Cys Ala 
165 170 175 

Cys Gly Arg Leu Cys Gly Glu Glu Ala Ala Asp Thr Gly Asp Asp lie 
180 185 190 



Leu Pro His Glu Thr Gly Leu Gin Pro Gly Met Val Pro Leu Lys Tyr 
195 200 205 

Leu Leu Glu Glu Gly Val Trp Gly Ala Gly Val Gly Cys Gly Val Phe 
210 215 220 

Pro Ala He Ser Thr Ala Tyr Asp His Cys Asn Asn Leu Ser Pro Ser 
225 230 235 240 



Glu Glu His 
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<210> 174 

<211> 147 

<212> PRT 

<213> Homo sapien 

<400> 174 

Met Ser Ser Glu Pro Pro Pro Pro Pro Gin Pro Pro Thr His Gin Ala 
15 10 15 

Ser Val Gly Leu Leu Asp Thr Pro Arg Ser Arg Glu Arg Ser Pro Ser 
20 25 30 



Pro Leu Arg Gly Asn Val Val Pro Ser Pro Leu Pro Thr Arg Arg Thr 
35 40 45 

Arg Thr Phe Ser Ala Thr Val Arg Ala Ser Gin Gly Pro Val Tyr Lys 
50 55 60 

Gly Val Cys Lys Cys Phe Cys Arg Ser Lys Gly His Gly Phe He Thr 
65 ^ 70 75 80 

Pro Ala Asp Gly Gly Pro Asp He Phe Leu His He Ser Asp Val Glu 
85 90 95 



Gly Glu Tyr Val Pro Val Glu Gly Asp Glu Val Thr Tyr Lys Met Cys 
100 105 HO 



Ser He Pro Pro Lys Asn Glu Lys Leu Gin Ala Val Glu Val Val He 
115 ~ 120 125 



Thr His Leu Ala Pro Gly Thr Lys His Glu Thr Trp Ser Gly His Val 
130 135 140 



He Ser Ser 
145 



<210> 175 

<211> 202 

<212> PRT 

<213> Homo sapien 

<400> 175 



Trp Gly Thr Ala Val Gly Arg Gly Trp Asn Glu Leu Cys Val Pro Arg 
1 5 ' " 10 15 
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Ser Ala Asp Gly Ala Pro Asp He Ser Pro Ser Leu Ser Gly Arg Cys 
20 25 30 

Gly Leu His Arg Ala Pro Ser Thr Lys Glu Ser Ala Asn Ala Ser Ala 
35 ^ 40 45 

Gly Pro Arg Ala Met Ala Ser Leu Pro Gin Leu Met Ala Ala Pro Thr 
50 55 60 

Ser Ser Cys Thr Ser Leu Met Trp Lys Gly Ser Met Ser Gin Trp Lys 
65 ** 70 75 80 



Ala Thr Arg Ser Pro He Lys Cys Ala Pro Ser His Pro Arg Met Arg 
85 90 95 

Ser Cys Arg Pro Trp Arg Ser Ser Ser Leu Thr Trp His Gin Ala Pro 
100 105 HO 



Ser Met Arg Pro Gly Leu Asp Met Ser Ser Ala Pro Arg Arg Trp Trp 
115 120 125 

Lys His Pro Leu Ser Cys Ala Cys Gly Arg Leu Cys Gly Glu Glu Ala 
130 135 140 

Ala Asp Thr Gly Asp Asp He Leu Pro His Glu Thr Gly Leu Gin Pro 
145 " 150 155 160 

Gly Met Val Pro Leu Lys Tyr Leu Leu Glu Glu Gly Val Trp Gly Ala 
165 170 175 



Gly Val Gly Cys Gly Val Phe Pro Ala He Ser Thr Ala Tyr Asp His 
180 185 190 



Cys Asn Asn Leu Ser Pro Ser Glu Glu His 
195 200 



<210> 176 

<211> 138 

<212> PRT 

<213> Homo sapien 

<400> 176 

Met Ala Ser Leu Pro Gin Leu Met Ala Ala Pro Thr Ser Ser Cys Thr 
15 10 15 



Ser Leu Met Trp Lys Gly Ser Met Ser Gin Trp Lys Ala Thr Arg Ser 
20 25 30 
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Pro He Lys Cys Ala Pro Ser His Pro Arg Met Arg Ser Cys Arg Pro 
35 40 45 

Trp Arg Ser Ser Ser Leu Thr Trp His Gin Ala Pro Ser Met Arg Pro 
50 55 60 

Gly Leu Asp Met Ser Ser Ala Pro Arg Arg Trp Trp Lys His Pro Leu 
65 " 70 75 80 

Ser Cys Ala Cys Gly Arg Leu Cys Gly Glu Glu Ala Ala Asp Thr Gly 
85 90 95 

Asp Asp He Leu Pro His Glu Thr Gly Leu Gin Pro Gly Met Val Pro 
100 105 HO 

Leu Lys Tyr Leu Leu Glu Glu Gly Val Trp Gly Gly Arg Cys Gly Val 
115 - 120 125 



Trp Gly Val Pro Gly His Gin His Ser Leu 
130 135 



<210> 177 

<211> 185 

<212> PRT 

<213> Homo sapien 

<400> 177 

Met Ser Ser Glu Pro Pro Pro Pro Pro Gin Pro Pro Thr His Gin Ala 
15 10 15 

Ser Val Gly Leu Leu Asp Thr Pro Arg Ser Arg Glu Arg Ser Pro Ser 
20 25 30 

Pro Ser Thr Lys Glu Ser Ala Asn Ala Ser Ala Gly Pro Arg Ala Met 
35 40 45 

Ala Ser Leu Pro Gin Leu Met Ala Ala Pro Thr Ser Ser Cys Thr Ser 
50 55 60 

Leu Met Trp Lys Gly Ser Met Ser Gin Trp Lys Ala Thr Arg Ser Pro 
65 ' ~ 70 75 80 

lie Lys Cys Ala Pro Ser His Pro Arg Met Arg Ser Cys Arg Pro Trp 
85 90 95 



WO 2004/050900 



PCT7US2003/040131 



240 

Arg Ser Ser Ser Leu Thr Trp His Gin Ala Pro Ser Met Arg Pro Gly 
100 105 HO 

Leu Asp Met Ser Ser Ala Pro Arg Arg Trp Trp Lys His Pro Leu Ser 
115 120 125 

Cys Ala Cys Gly Arg Leu Cys Gly Glu Glu Ala Ala Asp Thr Gly Asp 
130 ~ 135 14 0 

Asp He Leu Pro His Glu Thr Gly Leu Gin Pro Gly Met Val Pro Leu 
145 150 155 160 

Lys Tyr Leu Leu Glu Glu Gly Val Trp Gly Gly Arg Cys Gly Val Trp 
165 170 175 



Gly Val Pro Gly His Gin His Ser Leu 
180 185 



<210> 178 

<211> 265 

<212> PRT 

<213> Homo sapien 

<400> 178 

Ser Phe Pro Pro Ala His Leu Phe Ser Ala Cys Arg Gly Ser Ser Ser 
15 10 15 

Arg Pro Pro Arg Cys Phe Cys Leu Trp Ala Gly Ala Leu Asp Gly Gly 
20 25 3 0 

Leu Ala Gly Arg Trp Gly Glu Ala Arg Gly Ala Ser HiB Ala Gly Ser 
35 40 45 

Arg Ala Thr Pro Arg Arg Ala Trp Pro Arg Gin Leu Trp Leu Glu Val 
50 55 60 

Gly Thr Ser Ala Met Ser Ser Glu Pro Pro Pro Pro Pro Gin Pro Pro 
65 70 75 80 

Thr His Gin Ala Ser Val Gly Leu Leu Asp Thr Pro Arg Ser Arg Glu 
85 90 95 

Arg Ser Pro Ser Pro Ser Thr Lys Glu Ser Ala "Ash Ala Ser Ala Gly 
100 105 HO 



Pro Arg Ala Met Ala Ser Leu Pro Gin Leu Met Ala Ala Pro Thr Ser 
115 120 125 
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Ser Cys Thr Ser Leu Met Trp Lys Gly Ser Met Ser Gin Trp Lys Ala 
130 135 140 

Thr Arg Ser Pro He Lys Cys Ala Pro Ser His Pro Arg Met Arg Ser 
145 150 155 160 

Cys Arg Pro Trp Arg Ser Ser Ser Leu Thr Trp His Gin Ala Pro Ser 
165 170 175 

Met Arg Pro Gly Leu Asp Met Ser Ser Ala Pro Arg Arg Trp Trp Lys 
180 185 190 

His Pro Leu Ser Cys Ala Cys Gly Arg Leu Cys Gly Glu Glu Ala Ala 
195 " 200 205 

Asp Thr Gly Asp Asp He Leu Pro His Glu Thr Gly Leu Gin Pro Gly 
210 215 220 

Met Val Pro Leu Lys Tyr Leu Leu Glu Glu Gly Val Trp Gly Ala Gly 
225 230 235 240 

Val Gly Cys Gly Val Phe Pro Ala He Ser Thr Ala Tyr Asp His Cys 
245 250 255 



Asn Asn Leu Ser Pro Ser Glu Glu His 
260 265 



<210> 179 

<211> 201 

<212> PRT 

<213> Homo sapien 

<400> 179 

Met Cys Ser Thr Val Pro Trp Arg Arg Thr Leu Arg Glu Gly Gin Phe 
! 5 10 15 

Leu Pro Leu Leu Pro Cys Gly Val Trp Leu Pro Ala Ala Ser Gly Arg 
20 " 25 30 

Val Arg Gly Val Ala Glu Phe Gly Ser Arg Trp Leu Ala Leu Lys Ser 
35 40 45 



Pro Trp Leu Trp Val Phe Phe Phe Glu Thr Glu Ser Cys Ser Val Ala 
50 55 60 
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Gin Ala Gly Val Gin Trp Cys Asp Leu Ser Ser Leu Glu Pro Pro Pro 
65 ' 70 75 80 

Pro Arg Phe Lys Gin Phe Ser Cys Leu Ser Leu Gin Val Asp Gly He 
85 90 95 

Thr Gly Ala Cys His His Ala Gin Leu He Phe Val Phe Val Leu Glu 
100 105 HO 

Thr Gly Phe Pro His Val Gly Gin Ala Ser Leu Glu Leu Leu Thr Leu 
115 120 125 

Ser Asp Pro Pro Ala Ser Ala Ser Gin Ser Ala Gly He Ala Gly Val 
130 135 140 

Ser His Cys Ala Arg Pro Leu Ala Leu Gly Phe Leu Leu Thr Phe Leu 
145 150 155 160 

Leu Pro Ser His Lys Tyr Tyr Val Ser Gin Met Cys Arg Asp Pro Cys 
165 170 175 

Leu Val Leu Gly Thr Gin Arg Gly Pro Met Pro Ser Trp Ser Trp Gly 
180 185 190 



Arg Met Trp His Phe His Glu Glu Leu 
195 200 



<210> 180 

<211> 159 

<212> PRT 

<213> Homo sapien 

<400> 180 

Met Ala Gin Gly Leu Ala Val Arg Glu Met Thr Gly Met Thr Lys Phe 
15 10 15 

Lys Pro Tyr Phe He Ser Ala Thr Ser Glu He Leu Ser Gin Lys Cys 
20 25 30 

He Asn Thr Asn Val Leu Phe Leu Ser Leu Ser Asp Asn His Gly His 
35 40 45 

He Asp Pro Ser Leu Arg Leu lie Tirp Asp Leii Ala Phe Leu Gly . Ser 
50 55 60 

Ser Tyr Val Met Trp Glu Met Thr Thr Gin Val Ser His Tyr Tyr Leu 
65 70 75 80 
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Ala Gin Leu Thr Ser Val Arg Gin Trp Lys Thr Asn Asp Asp Thr He 
85 90 95 

Asp Phe Asp Tyr Thr Val Leu Leu His Glu Leu Ser Thr Gin Glu He 
100 105 HO 

He Pro Cys Arg He His Leu Val Trp Tyr Pro Gly Lys Pro Leu Lys 
115 120 125 

Val Lys Tyr His Cys Gin Glu Leu Gin Thr Pro Glu Glu Ala Ser Gly 
130 135 140 

Thr Glu Glu Gly Ser Ala Val Val Pro Thr Glu Leu Ser Asn Phe 
145 150 155 



<210> 181 

<211> 128 

<212> PRT 

<213> Homo sapien 

<400> 181 

Met Arg He His Asp Phe His He Leu Lys Gin Asn Thr Thr Lys Asn 
15 10 15 

Arg Glu Ala Glu He Glu Lys Ala Val Gly Asp Thr Arg His Pro Phe 
20 25 30 

Lys Thr Arg Met Tyr Cys He Val He Thr Leu Asn Thr Thr He Phe 
35 40 45 

He Thr Leu Thr Leu Phe Ser Pro Ser Arg Lys Thr Asn Asp Asp Thr 
50 55 60 

He Asp Phe Asp Tyr Thr Val Leu Leu His Glu Leu Ser Thr Gin Glu 
65 70 75 80 

He He Pro Cys Arg He His Leu Val Trp Tyr Pro Gly Lys Pro Leu 
85 90 95 

Lys Val Lys Tyr His Cys Gin Glu Leu Gin Thr Pro Glu Glu Ala Ser 
100 105 HO 



Gly Thr Glu Glu Gly Ser Ala Val Val Pro Thr Glu Leu Ser Asn Phe 
115 * 120 125 
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<210> 182 
<211> 224 
<212> PRT 
<213> Homo sapien 

<400> 182 

Met Asp Gly Asp Gin Thr Glu Glu Arg Met Met Lys Met Met Val His 
15 10 15 

Gin Arg Pro Leu Pro Gin Pro Ala Leu Leu Pro Met Ser Ser Asn Thr 
20 25 30 

Phe Pro Ser Arg Ser Thr Lys Pro Ser Pro Met Asn Pro Leu Pro Ser 
35 40 45 

Ser His Met Pro Gly Ala Phe Ser Glu Ser Asn Ser Ser Phe Pro Gin 
50 55 60 

Ser Ala Ser Leu Pro Pro Tyr Phe Ser Gin Gly Pro Ser Asn Arg Pro 
65 70 75 80 

Pro lie Arg Ala Glu Gly Arg Asn Phe Pro Leu Pro Leu Pro Asn Lys 
85 90 95 

Pro Arg Pro Pro Ser Pro Ala Glu Glu Glu Asn Ser Leu Asn Glu Glu 
100 105 HO 

Trp Tyr Val Ser Tyr lie Thr Arg Pro Glu Ala Glu Ala Ala Leu Arg 
115 120 125 

Lys lie Asn Gin Asp Gly Thr Phe Leu Val Arg Asp Ser Ser Lys Lys 
130 135 140 

Thr Thr Thr Asn Pro Tyr Val Leu Met Val Leu Tyr Lys Asp Lys Val 
145 150 155 160 

Tyr Asn lie Gin lie Arg Tyr Gin Lys Glu Ser Gin Val Tyr Leu Leu 
165 170 175 



Gly Thr Gly Leu Arg Gly Lys Glu Asp Phe Leu Ser Val Ser Asp lie 
180 185 190 



lie Asp Tyr Phe Arg Lys Met Pro Leu Leu Leu He Asp Gly Lys Asn 
195 " 200 205 



Arg Gly Ser Arg Tyr Gin Cys Thr Leu Thr His Ala Ala Gly Tyr Pro 
210 215 220 
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<210> 183 

<211> 230 

<212> PRT 

<213> Homo sapien 

<400> 183 

Met Phe Leu Ser Asp Ser Trp Gly Leu Pro lie Ser Phe lie Val Phe 
15 10 15 



Tyr Phe Cys Thr Val His Gin Arg Pro Leu Pro Gin Pro Ala Leu Leu 
20 25 30 



Pro Met Ser Ser Asn Thr Phe Pro Ser Arg Ser Thr Lys Pro Ser Pro 
35 40 45 



Met Asn Pro Leu Pro Ser Ser His Met Pro Gly Ala Phe Ser Glu Ser 
50 55 60 



Asn Ser Ser Phe Pro Gin Ser Ala Ser Leu Pro Pro Tyr Phe Ser Gin 
65 70 75 80 



Gly Pro Ser Asn Arg Pro Pro lie Arg Ala Glu Gly Arg Asn Phe Pro 
85 90 95 



Leu Pro Leu Pro Asn Lys Pro Arg Pro Pro Ser Pro Ala Glu Glu Glu 
100 105 HO 



Asn Ser Leu Asn Glu Glu Trp Tyr Val Ser Tyr lie Thr Arg Pro Glu 
115 120 125 



Ala Glu Ala Ala Leu Arg Lys He Asn Gin Asp Gly Thr Phe Leu Val 
130 135 140 

Arg Asp Ser Ser Lys Lys Thr Thr Thr Asn Pro Tyr Val Leu Met Val 
145 150 155 160 

Leu Tyr Lys Asp Lys Val Tyr Asn He Gin He Arg Tyr Gin Lys Glu 
165 170 175 

Ser Gin Val Tyr Leu Leu Gly Thr Gly Leu Arg Gly Lys Glu Asp Phe 
180 185 190 



Leu Ser Val Ser Asp He He Asp Tyr Phe Arg Lys Met Pro Leu Leu 
195 " 200 205 
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Leu lie Asp Gly Lys Asn Arg Gly Ser Arg Tyr Gin Cys Thr Leu Thr 
210 215 220 



His Ala Ala Gly Tyr Pro 
225 230 



<210> 184 

<211> 225 

<212> PRT 

<213> Homo sapien 

<400> 184 

Gin Leu Gly Pro Ala Asn Leu lie His Cys Phe Leu Leu Leu His Val 
1 5 10 15 

His Gin Arg Pro Leu Pro Gin Pro Ala Leu Leu Pro Met Ser Ser Asn 
20 25 30 

Thr Phe Pro Ser Arg Ser Thr Lys Pro Ser Pro Met Asn Pro Leu Pro 
35 40 45 

Ser Ser His Met Pro Gly Ala Phe Ser Glu Ser Asn Ser Ser Phe Pro 
50 55 60 

Gin Ser Ala Ser Leu Pro Pro Tyr Phe Ser Gin Gly Pro Ser Asn Arg 
65 70 75 80 

Pro Pro He Arg Ala Glu Gly Arg Asn Phe Pro Leu Pro Leu Pro Asn 
85 90 95 

Lys Pro Arg Pro Pro Ser Pro Ala Glu Glu Glu Asn Ser Leu Asn Glu 
100 105 HO 

Glu Trp Tyr Val Ser Tyr He Thr Arg Pro Glu Ala Glu Ala Ala Leu 
115 120 125 

Arg Lys He Asn Gin Asp Gly Thr Phe Leu Val Arg Asp Ser Ser Lys 
130 135 140 

Lys Thr Thr Thr Asn Pro Tyr Val Leu Met Val Leu Tyr Lys Asp Lys 
145 150 155 160 

Val Tyr Asn He Gin He Arg Tyr Gin Lys Glu Ser Gin Val Tyr Leu 
165 170 175 

Leu Gly Thr Gly Leu Arg Gly Lys Glu Asp Phe Leu Ser Val Ser Asp 
180 185 190 
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lie lie Asp Tyr Phe Arg Lys Met Pro Leu Leu Leu He Asp Gly Lys 
195 200 205 

Asn Arg Gly Ser Arg Tyr Gin Cys Thr Leu Thr His Ala Ala Gly Tyr 
210 215 220 



Pro 
225 



<210> 185 

<211> 1085 

<212> PRT 

<213> Homo sapien 

<400> 185 

Met Ala Ala Ser Thr Gly Tyr Val Arg Leu Trp Gly Ala Ala Arg Cys 
15 10 15 

Trp Val Leu Arg Arg Pro Met Leu Ala Ala Ala Gly Gly Arg Val Pro 
20 25 3 0 

Thr Ala Ala Gly Ala Trp Leu Leu Arg Gly Gin Arg Thr Cys Asp Ala 
35 40 45 

Ser Pro Pro Trp Ala Leu Trp Gly Arg Gly Pro Ala He Gly Gly Gin 
• 50 55 60 

Trp Arg Gly Phe Trp Glu Ala Ser Ser Arg Gly Gly Gly Ala Phe Ser 
65 " 70 75 80 

Gly Gly Glu Asp Ala Ser Glu Gly Gly Ala Glu Glu Gly Ala Gly Gly 
85 90 95 



Ala Gly Gly Ser Ala Gly Ala Gly Glu Gly Pro Val He Thr Ala Leu 
100 105 HO 



Thr Pro Met Thr He Pro Asp Val Phe Pro His Leu Pro Leu He Ala 
115 120 125 

He Thr Arg Asn Pro Val Phe Pro Arg Phe He Lys He He Glu Val 
130 135 140 



Lys Asn Lys Lys Leu Val Glu Leu Leu Arg Arg Lys Val Arg Leu Ala 
145 150 155 160 
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Gin Pro Tyr Val Gly Val Phe Leu Lys Arg Asp Asp Ser Asn Glu Ser 
165 170 175 



Asp Val Val Glu Ser Leu Asp Glu lie Tyr His Thr Gly Thr Phe Ala 
180 185 190 



Gin He His Glu Met Gin Asp Leu Gly Asp Lys Leu Arg Met He Val 
195 200 205 



Met Gly His Arg Arg Val His He Ser Arg Gin Leu Glu Val Glu Pro 
210 215 220 



Glu Glu Pro Glu Ala Glu Asn Lys His Lys Pro Arg Arg Lys Ser Lys 
225 230 235 240 

Arg Gly Lys Lys Glu Ala Glu Asp Glu Leu Ser Ala Arg His Pro Ala 
245 250 255 



Glu Leu Ala Met Glu Pro Thr Pro Glu Leu Pro Ala Glu Val Leu Met 
260 265 270 

Val Glu Val Glu Asn Val Val His Glu Asp Phe Gin Val Thr Glu Glu 
275 280 285 

Val Lys Ala Leu Thr Ala Glu He Val Lys Thr He Arg Asp He He 
290 295 300 

Ala Leu Asn Pro Leu Tyr Arg Glu Ser Val Leu Gin Met Met Gin Ala 
305 310 315 320 

Gly Gin Arg Val Val Asp Asn Pro He Tyr Leu Ser Asp Met Gly Ala 
325 330 335 



Ala Leu Thr Gly Ala Glu Ser His Glu Leu Gin Asp Val Leu Glu Glu 
340 345 350 



Thr Asn He Pro Lys Arg Leu Tyr Lys Ala Leu Ser Leu Leu Lys Lys 
355 360 365 

Glu Phe Glu Leu Ser Lys Leu Gin Gin Arg Leu Gly Arg Glu Val Glu 
370 375 380 

Glu Lys He Lys Gin Thr His Arg Lys Tyr Leu Leu Gin Glu Gin Leu 
385 390 395 400 



Lys He He Lys Lys Glu Leu Gly Leu Glu Lys Asp Asp Lys Asp Ala 
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405 410 4X5 

lie Glu Glu Lys Phe Arg Glu Arg Leu Lys Glu Leu Val Val Pro Lys 
420 425 430 

His Val Met Asp Val Val Asp Glu Glu Leu Ser Lys Leu Gly Leu Leu 
435 440 445 

Asp Asn His Ser Ser Glu Phe Asn Val Thr Arg Asn Tyr Leu Asp Trp 
450 455 460 

Leu Thr Ser lie Pro Trp Gly Lys Tyr Ser Asn Glu Asn Leu Asp Leu 
465 470 475 480 

Ala Arg Ala Gin Ala Val Leu Glu Glu Asp His Tyr Gly Met Glu Asp 
485 490 495 

Val Lys Lys Arg He Leu Glu Phe He Ala Val Ser Gin Leu Arg Gly 
500 505 510 

Ser Thr Gin Gly Lys He Leu Cys Phe Tyr Gly Pro Pro Gly Val Gly 
515 " 520 525 

Lys Thr Ser He Ala Arg Ser He Ala Arg Ala Leu Asn Arg Glu Tyr 
530 535 540 

Phe Arg Phe Ser Val Gly Gly Met Thr Asp Val Ala Glu He Lys Gly 
545 550 555 560 

His Arg Arg Thr Tyr Val Gly Ala Met Pro Gly Lys He He Gin Cys 
565 570 575 

Leu Lys Lys Thr Lys Thr Glu Asn Pro Leu He Leu He Asp Glu Val 
580 585 590 

Arg Ala Glu Arg Pro Gly Asp Pro Leu Thr Arg Gin Cys Arg Val Gin 
595 600 605 



Asp Glu Ala Glu Ala Phe Arg Val Leu Gly Ser Pro Ser Gin Leu Arg 
610 615 620 



Gly His Arg Arg- Asp Ala Gly Pro Asp Gly Thr Asp" Gly Ser Leu Pro 
625 " ~ ~ 630 635 640 



Thr His Ser His Ala Gin Gin Ala Ala Arg Pro Arg Gin Gly Val Gly 
645 650 655 
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Ala Ala Leu Thr Arg Gly Leu Pro Ser Pro Asp Pro Gly Val Ser Pro 
660 665 670 

Leu Gly Gly Leu Ala Arg Gly Thr Ala Arg Gly Thr Thr Cys Cys Leu 
675 680 685 

Leu Pro Gin Val Asp Lys He Gly Arg Gly Tyr Gin Gly Asp Pro Ser 
690 695 700 

Ser Ala Leu Leu Glu Leu Leu Asp Pro Glu Gin Asn Ala Asn Phe Leu 
705 710 715 720 

Asp His Tyr Leu Asp Val Pro Val Asp Leu Ser Lys Val Gly Gly Leu 
725 730 735 

Ser Gly Ala Trp Ala Cys Trp Gly Gly Tyr Ala Ala Ser Pro Pro Ala 
740 745 750 

Pro Cys Arg Arg Pro Gin Val Leu Phe He Cys Thr Ala Asn Val Thr 
755 " 760 765 

Asp Thr He Pro Glu Pro Leu Arg Asp Arg Met Glu Met He Asn Val 
770 775 780 

Ser Gly Tyr Val Ala Gin Glu Lys Leu Ala He Ala Glu Arg Tyr Leu 
785 790 795 800 

Val Pro Gin Ala Arg Ala Leu Cys Gly Leu Asp Glu Ser Lys Ala Lys 
805 810 815 

Leu Ser Ser Asp Val Leu Thr Leu Leu He Lys Gin Tyr Cys Arg Glu 
820 825 830 

Ser Gly Val Arg Asn Leu Gin Lys Gin Val Glu Lys Val Leu Arg Lys 
835 840 845 

Ser Ala Tyr Lys He Val Ser Gly Glu Ala Glu Ser Val Glu Val Thr 
850 855 860 

Pro Glu Asn Leu Gin Asp Phe Val Gly Lys Pro Val Phe Thr Val Glu 
865 " ~ '870 - 873' 880 

Arg Met Tyr Asp Val Thr Pro Pro Gly Val Val Met Gly Leu Ala Trp 
885 890 895 
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Thr Ala Met Gly Gly Ser Thr Leu Phe Val Glu Thr Ser Leu Arg Arg 
900 905 910 

Pro Gin Asp Lys Asp Ala Lys Gly Asp Lys Asp Gly Ser Leu Glu Val 
915 920 925 



Thr Gly Gin Leu Gly Glu Val Met Lys Glu Ser Ala Arg lie Ala Tyr 
930 935 940 

Thr Phe Ala Arg Ala Phe Leu Met Gin His Ala Pro Ala Asn Asp Tyr 
945 950 955 960 

Leu Val Thr Ser His He His Leu His Val Pro Glu Gly Ala Thr Pro 
965 970 975 



Lys Asp Gly Pro Ser Ala Gly Cys Thr He Val Thr Ala Leu Leu Ser 
980 985 990 

Leu Ala Met Gly Arg Pro Val Arg Gin Asn Leu Ala Met Thr Gly Glu 
995 1000 1005 

Val Ser Leu Thr Gly Lys He Leu Pro Val Gly Gly He Lys Glu 
1010 1015 1020 

Lys Thr He Ala Ala Lys Arg Ala Gly Val Thr Cys He Val Leu 
1025 1030 1035 

Pro Ala Glu Asn Lys Lys Asp Phe Tyr Asp Leu Ala Ala Phe He 
1040 1045 1050 

Thr Glu Gly Leu Glu Val His Phe Val Glu His Tyr Arg Glu He 
1055 1060 1065 

Phe Asp He Ala Phe Pro Asp Glu Gin Ala Glu Ala Leu Ala Val 
1070 1075 1080 



Glu Arg 
1085 



<210> 186 

<211> 668 

<212> PRT 

<213> Homo sapien 

<400> 186 



Asn Ser Ala Pro Ser Ser Pro Arg Arg Pro Ser Ser Leu Lys Arg Leu 
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10 15 



Phe Arg Ala Thr Arg Pro Ser Gly Thr Glu Ala Arg Ala Gly Arg His 
20 25 30 

Val Arg Phe Ala Ala Ser Gly Asn Asp Ala Ser Cys Val Ser Arg Gin 
35 40 45 

Tyr Gly Arg Ala Met Ala Ala Ser Thr Gly Tyr Val Arg Leu Trp Gly 
50 55 60 

Ala Ala Arg Cys Trp Val Leu Arg Arg Pro Met Leu Ala Ala Ala Gly 
65 70 75 80 

Gly Arg Val Pro Thr Ala Ala Gly Ala Trp Leu Leu Arg Gly Gin Arg 
85 90 95 

Thr Cys Asp Ala Ser Pro Pro Trp Ala Leu Trp Gly Arg Gly Pro Ala 
100 105 HO 

lie Gly Gly Gin Trp Arg Gly Phe Trp Glu Ala Ser Ser Arg Gly Gly 
115 120 125 

Gly Ala Phe Ser Gly Gly Glu Asp Ala Ser Glu Gly Gly Ala Glu Glu 
130 135 140 

Gly Ala Gly Gly Ala Gly Gly Ser Ala Gly Ala Gly Glu Gly Pro Val 
145 ^ ^ 150 155 160 

lie Thr Ala Leu Thr Pro Met Thr lie Pro Asp Val Phe Pro His Leu 
165 170 175 



Pro Leu He Ala He Thr Arg Asn Pro Val Phe Pro Arg Phe He Lys 
180 185 190 



He He Glu Val Lys Asn Lys Lys Leu Val Glu Leu Leu Arg Arg Lys 
195 200 205 



Val Arg Leu Ala Gin Pro Tyr Val Gly Val Phe Leu Lys Arg Asp Asp 
210 215 220 

Ser Asn Glu Ser Asp Val Val Glu Ser Leu Asp Glu He Tyr His Thr 
225 230 235 240 



Gly Thr Phe Ala Gin He His Glu Met Gin Asp Leu Gly Asp Lys Leu 
245 250 255 
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Arg Met He Val Met Gly His Arg Arg Val His He Ser Arg Gin Leu 
260 265 270 



Glu Val Glu Pro Glu Glu Pro Glu Ala Glu Asn Lys His Lys Pro Arg 
275 280 285 

Arg Lys Ser Lys Arg Gly Lys Lys Glu Ala Glu Asp Glu Leu Ser Ala 
290 295 300 

Arg His Pro Ala Glu Leu Ala Met Glu Pro Thr Pro Glu Leu Pro Ala 
305 310 315 320 

Glu Val Leu Met Val Glu Val Glu Asn Val Val His Glu Asp Phe Gin 
325 330 335 



Val Thr Glu Glu Val Lys Ala Leu Thr Ala Glu He Val Lys Thr He 
340 345 350 



Arg Asp He He Ala Leu Asn Pro Leu Tyr Arg Glu Ser Val Leu Gin 
355 360 365 



Met Met Gin Ala Gly Gin Arg Val Val Asp Asn Pro He Tyr Leu Ser 
370 375 380 



Asp Met Gly Ala Ala Leu Thr Gly Ala Glu Ser His Glu Leu Gin Asp 
385 " 390 395 400 



Val Leu Glu Glu Thr Asn He Pro Lys Arg Leu Tyr Lys Ala Leu Ser 
405 410 415 



Leu Leu Lys Lys Glu Phe Glu Leu Ser Lys Leu Gin Gin Arg Leu Gly 
420 425 430 



Arg Glu Val Glu Glu Lys He Lys Gin Thr His Arg Lys Tyr Leu Leu 
435 440 445 



Gin Glu Gin Leu Lys He He Lys Lys Glu Leu Gly Leu Glu Lys Asp 
450 455 460 



Asp Lys Asp Ala He Glu Glu Lys Phe Arg Glu Arg Leu Lys Glu Leu 
465 470 V 475 480 



Val Val Pro Lys His Val Met Asp Val Val Asp Glu Glu Leu Ser Lys 
485 490 495 



WO 2004/050900 



PCT/US2003/040131 



254 

lieu Gly Leu Leu Asp Asn His Ser Ser Glu Phe Asn Val Thr Arg Asn 
500 505 510 

Tyr Leu Asp Trp Leu Thr Ser He Pro Trp Gly Lys Tyr Ser Asn Glu 
515 520 525 

Asn Leu Asp Leu Ala Arg Ala Gin Ala Val Leu Glu Glu Asp His Tyr 
530 535 540 

Gly Met Glu Asp Val Lys Lys Arg He Leu Glu Phe He Ala Val Ser 
545 550 555 560 

Gin Leu Arg Gly Ser Thr Gin Gly Lys He Leu Cys Phe Tyr Gly Pro 
565 570 575 

Pro Gly Val Gly Lys Thr Ser He Ala Arg Ser He Ala Arg Ala Leu 
580 585 590 

Asn Arg Glu Tyr Phe Arg Phe Ser Val Gly Gly Met Thr Asp Val Ala 
595 600 605 

Glu He Lys Gly His Arg Arg Thr Tyr Val Gly Ala Met Pro Gly Lys 
610 615 620 

He He Gin Cys Leu Lys Lys Thr Lys Thr Glu Asn Pro Leu He Leu 
625 * 630 * 635 640 

He Asp Glu Val Arg Ala Glu Arg Pro Gly Asp Pro Leu Thr Arg Gin 
645 650 655 



Cys Arg Val Gin Asp Glu Ala Glu Pro Ser Gly Ser 
660 665 



<210> 187 

<211> 771 

<212> PRT 

<213> Homo sapien 

<400> 187 

Met Glu Glu Arg Lys His Ser Leu Phe Phe Ser Pro Glu He Ala Tyr 
15 10 15 

Pro Lys Asp Lys Ala Leu Arg Gly Lys Glu Thr Gly Phe Gly Phe Ser 
20 25 30 



Phe Asp Ser Leu Gly Pro Val Ser Leu Gly Leu Val Ser Pro Pro Gly 
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35 40 45 

Thr Ala Leu Leu His Leu Pro Gly Arg Pro Trp Pro Gly Pro Val Gly 
50 55 60 

Gly Gly Glu Gly Val Asp Gly Ala Gly Gly Trp Gly Ser Gin Gly His 
65 70 75 80 

Ala Pro Thr Gin Pro Thr Pro Cys Trp Ser Thr Ala Gly Pro Gin Arg 
85 90 95 

Pro Cys Pro He Leu Gly Ala Leu Thr Ala Glu He Val Lys Thr He 
100 105 HO 

Arg Asp He He Ala Leu Asn Pro Leu Tyr Arg Glu Ser Val Leu Gin 
115 120 125 

Met Met Gin Ala Gly Gin Arg Val Val Asp Asn Pro He Tyr Leu Ser 
130 135 140 

Asp Met Gly Ala Ala Leu Thr Gly Ala Glu Ser His Glu Leu Gin Asp 
145 " 150 155 160 

Val Leu Glu Glu Thr Asn He Pro Lys Arg Leu Tyr Lys Ala Leu Ser 
165 170 175 

Leu Leu Lys Lys Glu Phe Glu Leu Ser Lys Leu Gin Gin Arg Leu Gly 
180 185 190 

Arg Glu Val Glu Glu Lys He Lys Gin Thr His Arg Lys Tyr Leu Leu 
195 200 205 

Gin Glu Gin Leu Lys He He Lys Lys Glu Leu Gly Leu Glu Lys Asp 
210 * 215 220 

Asp Lys Asp Ala He Glu Glu Lys Phe Arg Glu Arg Leu Lys Glu Leu 
225 230 235 240 

Val Val Pro Lys His Val Met Asp Val Val Asp Glu Glu Leu Ser Lys 
245 250 255 

Leu Gly Leu Leu Asp Asn His Ser Ser Glu Phe Asn Val Thr Arg Asn 
260 265 270 

Tyr Leu Asp Trp Leu Thr Ser He Pro Trp Gly Lys Tyr Ser Asn Glu 
275 280 285 
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Asn Leu Asp Leu Ala Arg Ala Gin Ala Val Leu Glu Glu Asp His Tyr 
290 295 300 

Gly Met Glu Asp Val Lys Lys Arg lie Leu Glu Phe lie Ala Val Ser 
305 310 315 320 

Gin Leu Arg Gly Ser Thr Gin Gly Lys He Leu Cys Phe Tyr Gly Pro 
325 330 335 



Pro Gly Val Gly Lys Thr Ser He Ala Arg Ser He Ala Arg Ala Leu 
340 345 350 



Asn Arg Glu Tyr Phe Arg Phe Ser Val Gly Gly Met Thr Asp Val Ala 
355 360 365 



Glu He Lys Gly His Arg Arg Thr Tyr Val Gly Ala Met Pro Gly Lys 
370 375 380 



He He Gin Cys Leu Lys Lys Thr Lys Thr Glu Asn Pro Leu He Leu 
385 390 395 400 



He Asp Glu Val Asp Lys He Gly Arg Gly Tyr Gin Gly Asp Pro Ser 
405 410 415 



Ser Ala Leu Leu Glu Leu Leu Asp Pro Glu Gin Asn Ala Asn Phe Leu 
420 425 430 



Asp His Tyr Leu Asp Val Pro Val Asp Leu Ser Lys Val Leu Phe He 
435 440 445 



Cys Thr Ala Asn Val Thr Asp Thr He Pro Glu Pro Leu Arg Asp Arg 

450 455 460 

Met Glu Met He Asn Val Ser Gly Tyr Val Ala Gin Glu Lys Leu Ala 
465 470 475 480 



He Ala Glu Arg Tyr Leu Val Pro Gin Ala Arg Ala Leu Cys Gly Leu 
485 490 495 



Asp Glu Ser Lys Ala Lys Leu Ser Ser Asp Val Leu Thr Leu Leu He 
" 500 505 510' 



Lys Gin Tyr Cys Arg Glu Ser Gly Val Arg Asn Leu Gin Lys Gin Val 
515 520 525 
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Glu Lys Val Leu Arg Lys Ser Ala Tyr Lys lie Val Ser Gly Glu Ala 
530 535 540 



Glu Ser Val Glu Val Thr Pro Glu Asn Leu Gin Asp Phe Val Gly Lys 
545 550 555 560 



Pro Val Phe Thr Val Glu Arg Met Tyr Asp Val Thr Pro Pro Gly Val 
565 570 575 



Val Met Gly Leu Ala Trp Thr Ala Met Gly Gly Ser Thr Leu Phe Val 
580 585 590 



Glu Thr Ser Leu Arg Arg Pro Gin Asp Lys Asp Ala Lys Gly Asp Lys 
595 600 605 



Asp Gly Ser Leu Glu Val Thr Gly Gin Leu Gly Glu Val Met Lys Glu 
610 615 620 



Ser Ala Arg He Ala Tyr Thr Phe Ala Arg Ala Phe Leu Met Gin His 
625 630 635 640 



Ala Pro Ala Asn Asp Tyr Leu Val Thr Ser His He His Leu His Val 
645 650 655 



Pro Glu Gly Ala Thr Pro Lys Asp Gly Pro Ser Ala Gly Cys Thr He 
660 665 670 



Val Thr Ala Leu Leu Ser Leu Ala Met Gly Arg Pro Val Arg Gin Asn 
675 680 685 



Leu Ala Met Thr Gly Glu Val Ser Leu Thr Gly Lys He Leu Pro Val 
690 695 700 



Gly Gly He Lys Glu Lys Thr He Ala Ala Lys Arg Ala Gly Val Thr 
705 " 710 715 720 



Cys He Val Leu Pro Ala Glu Asn Lys Lys Asp Phe Tyr Asp Leu Ala 
725 730 735 



Ala Phe He Thr Glu Gly Leu Glu Val His Phe Val Glu His Tyr Arg 
740 745 750 



Glu He Phe Asp He Ala Phe Pro Asp Glu Gin Ala Glu Ala Leu Ala 
755 760 765 
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Val Glu Arg 
770 



<210> 188 

<211> 848 

<212> PRT 

<213> Homo sapien 

<400> 188 



Met Ala Ala Ser Thr Gly Tyr Val Arg Leu Trp Gly Ala Ala Arg Cys 
15 10 15 



Trp Val Leu Arg Arg Pro Met Leu Ala Ala Ala Gly Gly Arg Val Pro 
20 25 30 



Thr Ala Ala Gly Ala Trp Leu Leu Arg Gly Gin Arg Thr Cys Asp Ala 
35 ** 40 45 



Ser Pro Pro Trp Ala Leu Trp Gly Arg Gly Pro Ala lie Gly Gly Gin 
50 55 60 



Trp Arg Gly Phe Trp Glu Ala Ser Ser Arg Gly Gly Gly Ala Phe Ser 
65 70 75 80 



Gly Gly Glu Asp Ala Ser Glu Gly Gly Ala Glu Glu Gly Ala Gly Gly 
85 90 95 



Ala Gly Gly Ser Ala Gly Ala Gly Glu Gly Pro Val lie Thr Ala Leu 
100 105 110 



Thr Pro Met Thr lie Pro Asp Val Phe Pro His Leu Pro Leu He Ala 
115 120 125 



He Thr Arg Asn Pro Val Phe Pro Arg Phe He Lys He He Glu Val 
130 ~ 135 140 



Lys Asn Lys Lys Leu Val Glu Leu Leu Arg Arg Lys Val Arg Leu Ala 
145 150 155 160 



Gin Pro Tyr Val Gly Val Phe Leu Lys Arg Asp Asp Ser Asn Glu Ser 
165 170 175 

Asp Val Val Glu Ser Leu Asp Glu lie Tyr His Thr Gly Thr Phe Ala 
180 185 190 



Gin He His Glu Met Gin Asp Leu Gly Asp Lys Leu Arg Met He Val 
195 200 205 
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Met Gly His Arg Arg Val His lie Ser Arg Gin Leu Glu Val Glu Pro 
210 215 220 



Glu Glu Pro Glu Ala Glu Asn Lys His Lys Pro Arg Arg Lys Ser Lys 
225 230 235 240 



Arg Gly Iiys Lys Glu Ala Glu Asp Glu Leu Ser Ala Arg His Pro Ala 
245 250 255 



Glu Leu Ala Met Glu Pro Thr Pro Glu Leu Pro Ala Glu Val Leu Met 
260 265 270 



Val Glu Val Glu Asn Val Val His Glu Asp Phe Gin Val Thr Glu Glu 
275 280 285 



Val Lys Ala Leu Thr Ala Glu lie Val Lys Thr He Arg Asp He He 
290 295 300 



Ala Leu Asn Pro Leu Tyr Arg Glu Ser Val Leu Gin Met Met Gin Ala 
305 310 315 320 



Gly Gin Arg Val Val Asp Asn Pro He Tyr Leu Ser Asp Met Gly Ala 
325 330 335 



Ala Leu Thr Gly Ala Glu Ser His Glu Leu Gin Asp Val Leu Glu Glu 
340 345 350 



Thr Asn He Pro Lys Arg Leu Tyr Lys Ala Leu Ser Leu Leu Lys Lys 
355 ~ 360 365 



Glu Phe Glu Leu Ser Lys Leu Gin Gin Arg Leu Gly Arg Glu Val Glu 
370 375 380 



Glu Lys He Lys Gin Thr His Arg Lys Tyr Leu Leu Gin Glu Gin Leu 
385 390 395 400 



Lys He He Lys Lys Glu Leu Gly Leu Glu Lys Asp Asp Lys Asp Ala 
405 410 415 



He Glu Glu Lys Phe Arg Glu Arg Leu Lys Glu Leu Val Val Pro Lys 
420 425" 430 



His Val Met Asp Val Val Asp Glu Glu Leu Ser Lys Leu Gly Leu Leu 
435 440 445 



WO 2004/050900 



PCT/US2003/040131 



260 

Asp Asn His Ser Ser Glu Phe Asn Val Thr Arg Asn Tyr Leu Asp Trp 
450 455 460 

Leu Thr Ser He Pro Trp Gly Lys Tyr Ser Asn Glu Asn Leu Asp Leu 
465 470 475 480 

Ala Arg Ala Gin Ala Val Leu Glu Glu Asp His Tyr Gly Met Glu Asp 
485 490 495 

Val Lys Lys Arg He Leu Glu Phe He Ala Val Ser Gin Leu Arg Gly 
500 505 510 

Ser Thr Gin Gly Lys He Leu Cys Phe Tyr Gly Pro Pro Gly Val Gly 
515 * 520 525 

Lys Thr Ser He Ala Arg Ser He Ala Arg Ala Leu Asn Arg Glu Tyr 
530 535 540 

Phe Arg Phe Ser Val Gly Gly Met Thr Asp Val Ala Glu He Lys Gly 
545 550 555 560 

His Arg Arg Thr Tyr Val Gly Ala Met Pro Gly Lys He He Gin Cys 
565 570 575 

Leu Lys Lys Thr Lys Thr Glu Asn Pro Leu He Leu He Asp Glu Val 
580 585 590 

Asp Lys He Gly Arg Gly Tyr Gin Gly Asp Pro Ser Ser Ala Leu Leu 
595 600 605 

Glu Leu Leu Asp Pro Glu Gin Asn Ala Asn Phe Leu Asp His Tyr Leu 
610 615 620 

Asp Val Pro Val Asp Leu Ser Lys Val Leu Phe He Cys Thr Ala Asn 
625 630 635 640 

Val Thr Asp Thr He Pro Glu Pro Leu Arg Asp Arg Met Glu Met He 
645 650 655 



Asn Val Ser Gly Tyr Val Ala Gin Glu Lys Leu Ala He Ala Glu Arg 
660 665 670 

Tyr Leu Val Pro Gin Ala Arg Ala Leu Cys Gly Leu Asp Glu Ser Lys 
675 680 685 
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Ala Lys Leu Ser Ser Asp Val Leu Thr Leu Leu lie Lys Gin Tyr Cys 
690 695 700 

Arg Glu Ser Gly Val Arg Asn Leu Gin Lys Gin Val Glu Lys Val Leu 
705 710 715 720 

Arg Lys Ser Ala Tyr Lys lie Val Ser Gly Glu Ala Glu Ser Val Glu 
725 730 735 



Val Thr Pro Glu Asn Leu Gin Asp Phe Val Gly Lys Pro Val Phe Thr 
740 745 750 



Val Glu Arg Met Tyr Asp Val Thr Pro Pro Gly Val Val Met Gly Leu 
755 760 765 



Ala Trp Thr Ala Met Gly Glu Arg Gly Gly Gly Arg Arg Pro Gin Ser 
770 775 780 



His Ser His Phe Tyr Pro Arg Thr Ser Arg Ser His Leu Val His Leu 
785 790 795 800 



Cys Ser Gly Pro Gin Val Ala Leu Asn Gly Leu Trp Arg Gly Val Val 
805 810 815 



Cys Trp Glu Pro Arg Gly Ser Gly Leu Arg Glu Gly Lys Ala Val Thr 
820 825 830 



His Gly Gly Gly Ser Thr Pro Trp Ala Leu Trp lie Trp Val Pro Leu 
835 840 845 



<210> 189 

<211> 124 

<212> PRT 

<213> Homo sapien 

<400> 189 

Met Val Phe Leu His Val Gly Gin Ala Gly Leu Glu Leu Pro Thr Ser 
15 10 15 



Gly Asp Pro Pro Thr Ser Ala Ser Gin Ser Ala Gly Met Thr Glu Leu 
20 25 30 



Glu Leu Gly Pro Ser Pro Arg Leu Gin Pro He Arg Arg His Pro Lys 
35 40 45 



Glu Leu Pro Gin Tyr Gly Gly Pro Gly Lys Asp He Phe Glu Asp Gin 
50 55 60 
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Leu Tyr Leu Pro Val His Ser Asp Gly He Ser Val His Gin Met Phe 
65 70 75 80 

Thr Met Ala Thr Ala Glu His Arg Ser Asn Ser Ser He Ala Gly Lys 
85 90 95 

Met Leu Thr Lys Val Glu Lys Asn His Glu Lys Glu Lys Ser Gin His 
100 105 HO 



Leu Glu Gly Ser Ala Ser Ser Ser Leu Ser Ser Asp 
115 120 



<210> 190 

<211> 296 

<212> PRT 

<213> Homo sapien 

<400> 190 

Met Ser Thr Glu Arg Thr Ser Trp Thr Ser Leu Ser Thr lie Gin Lys 
15 10 15 

He Ala Leu Gly Leu Gly He Pro Ala Ser Ala Thr Val Ala Tyr He 
20 25 30 

Leu Tyr Arg Arg Tyr Arg Glu Ser Arg Glu Glu Arg Leu Thr Phe Val 
35 40 45 

Gly Glu Asp Asp He Glu He Glu Met Arg. Val Pro Gin Glu Ala Val 
50 55 60 

Lys Leu He He Gly Arg Gin Gly Ala Asn He Lys Gin Leu Arg Lys 
65 70 75 80 

Gin Thr Gly Ala Arg He Asp Val Asp Thr Glu Asp Val Gly Asp Glu 
85 90 95 



Arg Val Leu Leu He Ser Gly Phe Pro Val Gin Val Cys Lys Ala Lys 
100 105 HO 

Ala Ala He His Gin He Leu Thr Glu Asn Thr Pro Val Ser Glu Gin 
115 120 125 

Leu Ser Val Pro Gin Arg Ser Val Gly Arg He He Gly Arg Gly Gly 
130 135 140 
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Glu Thr lie Arg Ser lie Cys Lys Ala Ser Gly Ala Lys lie Thr Cys 
145 150 155 160 



Asp Lys Glu Ser Glu Gly Thr Leu Leu Leu Ser Arg Leu He Lys He 
165 170 175 



Ser Gly Thr Gin Lys Glu Val Ala Ala Ala Lys His Leu He Leu Glu 
180 185 190 



Lys Val Ser Glu Asp Glu Glu Leu Arg Lys Arg He Ala His Ser Ala 
195 200 205 



Glu Thr Arg Val Pro Arg Lys Gin Pro He Ser Val Arg Arg Glu Asp 
210 215 220 



Met Thr Glu Pro Gly Gly Ala Gly Glu Pro Ala Leu Trp Lys Asn Thr 
225 230 235 240 



Ser Ser Ser Met Glu Pro Thr Ala Pro Leu Val Thr Pro Pro Pro Lys 
245 250 255 



Gly Gly Gly Asp Met Ala Val Val Val Ser Lys Glu Gly Ser Trp Glu 
260 265 270 



Lys Pro Ser Asp Asp Ser Phe Gin Lys Ser Glu Ala Gin Ala He Pro 
275 280 285 



Glu Met Pro Met Phe Glu Ser Met 
290 295 



<210> 191 

<211> 195 

<212> PRT 

<213> Homo sapien 

<400> 191 

Ala Arg Tyr Glu Ala Trp Gly Glu Ser Ala Glu Ala His Val Leu Glu 
1 ~ 5 10 15 



Gly Pro Asp Thr Asn Thr Thr He He Gin Leu Gin Pro Leu Gin Glu 
20 25 30 



Pro Glu Ser Trp Ala Arg Thr Gin Ser Gly Leu Gin Ser Tyr Leu. "Leu 
35 40 45 



Gin Phe His Gly Leu Val Arg Leu Val His Gin Glu Arg Thr Leu Ala 
50 55 60 
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Phe Pro Leu Thr lie Arg Cys Phe Leu Gly Cys Glu Leu Pro Pro Glu 
65 70 75 80 

Gly Ser Arg Ala His Val Phe Phe Glu Val Ala Val Asn Gly Ser Ser 
85 90 95 



Phe Val Ser Phe Arg Pro Glu Arg Ala Leu Trp Gin Ala Asp Thr Gin 
100 105 HO 



Val Thr Ser Gly Val Val Thr Phe Thr Leu Gin Gin Leu Asn Ala Tyr 
115 120 125 

Asn Arg Thr Arg Tyr Glu Leu Arg Glu Phe Leu Glu Asp Thr Cys Val 
130 135 140 

Gin Tyr Val Gin Lys His He Ser Ala Glu Asn Thr Lys Gly Ser Gin 
145 150 155 160 

Thr Ser Arg Ser Tyr Thr Ser Leu Val Leu Gly Val Leu Val Gly Ser 
165 170 175 



Phe He He Ala Gly Val Ala Val Gly He Phe Leu Cys Thr Gly Gly 
180 185 190 



Arg Arg Cys 
195 



<210> 192 

<211> 194 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MIS C_FEATURE 

<222> (6) . . (7) 

<223> X=any amino acid 



<400> 192 

Leu Gly Thr Gly Arg Xaa Xaa Ser Ala Glu Ala His Val Leu Glu Gly 
15 10 15 



Pro Asp Thr Asn Thr Thr" He' lie Gin Leu Gin Pro Leu Gin Glu Pro 
20 25 3 0 



Glu Ser Trp Ala Arg Thr Gin Ser Gly Leu Gin Ser Tyr Leu Leu Gin 
35 40 45 
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Phe His Gly Leu Val Arg Leu Val His Gin Glu Arg Thr Leu Ala Phe 
50 ' 55 60 

Pro Leu Thr lie Arg Cys Phe Leu Gly Cys Glu Leu Pro Pro Glu Gly 
65 70 75 80 

Ser Arg Ala His Val Phe Phe Glu Val Ala Val Asn Gly Ser Ser Phe 
85 90 95 



Val Ser Phe Arg Pro Glu Arg Ala Leu Trp Gin Ala Asp Thr Gin Val 
100 105 HO 



Thr Ser Gly Val Val Thr Phe Thr Leu Gin Gin Leu Asn Ala Tyr Asn 
115 120 125 



Arg Thr Arg Tyr Glu Leu Arg Glu Phe Leu Glu Asp Thr Cys Val Gin 
130 135 140 



Tyr Val Gin Lys His lie Ser Ala Glu Asn Thr Lys Gly Ser Gin Thr 
145 150 155 160 



Ser Arg Ser Tyr Thr Ser Leu Val Leu Gly Val Leu Val Gly Ser Phe 
165 170 175 



He He Ala Gly Val Ala Val Gly He Phe Leu Cys Thr Gly Gly Arg 
180 185 190 



Arg Cys 



<210> 193 

<211> 132 

<212> PRT 

<213> Homo sapien 

<400> 193 

Met Asn Tyr Thr Gin Arg Glu Leu Gin Met Ala Ala Pro Thr His Leu 
15 10 15 



Asn His Thr Val Val Gly Thr Pro Cys Gly Asn Gin Thr Leu Ala Thr 
20 25 30 



Thr Arg Arg Lys His Leu Ala Trp Arg Glu Arg Arg Pro Ala His Thr 
35 ~ 40 45 
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Thr Pro Ala Arg Ala Arg Asp Gly Asn Pro Asn lie Gly Val Gly Ala 
50 55 60 

Ala Asp Lys Pro Pro Ser Leu Leu Asn His Ala Arg Arg Ser Ser Leu 
65 * 70 75 80 

Pro Asn Arg Pro Pro Arg Ser Thr Gly Gly Asp Glu Ser Leu He Thr 
85 90 95 

His Asn Pro Ser Tyr Ser His Gly Arg Arg Ala He Leu His Ala Cys 
100 105 HO 

Val Val Pro His His Thr Glu Arg Arg Val Ala Ser He He Cys Arg 
115 120 125 



Pro Gly Pro Arg 
130 



<210> 194 

<211> 199 

<212> PRT 

<213> Homo sapien 

<400> 194 

He He Thr He Lys Leu Phe Lys Lys Lys Lys Lys Lys Gin Thr Lys 
15 10 15 

Asn He Lys Thr Lys Lys Gin Lys Glu Lys Lys Lys Lys Gin Asn Arg 
20 25 30 

Gly Ala Gly Ala Pro Gin Lys Lys Ser Arg Pro Arg Gly Ala Ala Pro 
35 40 45 

He Thr Ala Arg Pro Pro Gly Phe Leu Val Ser Thr Gin Gin Gly Gly 
50 55 60 

Ala Pro His Lys Asn Glu Arg Ala His Arg Ser His Asn Tyr Thr Gin 
65 " 70 75 80 

Arg Glu Leu Arg Trp Pro Arg Pro His Thr Ser Thr Thr Gin Ser Trp 
85 90 95 

Val His Pro Ala Ala Thr* Lys Arg- Trp Pro" Gin Pro Gly Gly Asn lie 
100 105 HO 



Leu Arg Gly Glu Lys Gly Asp Pro Leu Thr Leu Pro Gin Arg Glu Arg 
115 120 125 
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Gly Thr Ala Thr Gin Thr Ser Val Trp Gly Pro Pro Thr Asn Leu Pro 
130 135 140 

His Ser Ser Thr Met Arg Asp Asp Leu His Tyr Gin lie Ala Pro Leu 
145 150 155 160 

Ala Arg Arg Ala Ala Thr Asn Arg Ser Ser His Thr lie His Leu He 
165 170 175 



His Thr Ala Val Ala Leu Ser Ser Thr Arg Val Trp Cys Leu He Thr 
180 185 190 



Pro Asn Asp Gly Ser His Gin 
195 



<210> 195 

<211> 259 

<212> PRT 

<213> Homo sapien 

<400> 195 

Met Glu Gly Arg Arg Gin Ala Arg Arg Leu Arg Gin Leu Ala Gly Ala 
15 10 15 

Gly Ala Gin Gly Gly Ser Pro He Val Glu Ala Ala Glu Asn Cys His 
20 25 3 0 

Gly Ala Ala Ser Val Gin Arg Ala Ser Leu He Tyr Ser Gly Arg Val 
35 40 45 

Cys He Trp Gly Gly Val Gin Gly Ala Asp Lys Asp Arg Arg Ala Pro 
50 ~ 55 60 

Gly Ala Ala Ala Gly Gly Ala Lys Thr Gly Thr Arg Met Ser Ala Pro 
65 70 75 80 

Gin Arg Pro Trp Ala Leu Ala Ala Gly Ala Arg Arg Thr Pro Arg Glu 
85 90 95 



Ala Gly He Ser Lys Ala Val Arg Pro Glu Ala Arg Pro Arg Leu Arg 
100 105 HO 



Thr Lys Thr Glu Leu Glu Asp Leu Gin Lys Lys Pro Pro Pro Tyr Leu 
115 120 125 
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Arg Asn Leu Ser Ser Asp Asp Ala Asn Val Leu Val Trp His Ala Leu 
130 135 140 

Leu Leu Pro Asp Gin Pro Pro Tyr His Leu Lys Ala Phe Asn Leu Arg 
145 150 155 160 

lie Ser Phe Pro Pro Glu Tyr Pro Phe Lys Pro Pro Met lie Lys Phe 
165 170 175 

Thr Thr Lys lie Tyr His Pro Asn Val Asp Glu Asn Gly Gin lie Cys 
180 185 190 

Leu Pro He He Ser Ser Glu Asn Trp Lys Pro Cys Thr Lys Thr Cys 
195 200 205 

Gin Val Leu Glu Ala Leu Asn Val Leu Val Asn Arg Pro Asn He Arg 
210 215 220 

Glu Pro Leu Arg Met Asp Leu Ala Asp Leu Leu Thr Gin Asn Pro Glu 
225 230 235 240 

Leu Phe Arg Lys Asn Ala Glu Glu Phe Thr Leu Arg Phe Gly Val Asp 
245 250 255 



Arg Pro Ser 



<210> 196 

<211> 156 

<212> PRT 

<213> Homo sapien 

<400> 196 

Met Arg Arg Trp Thr Trp Lys Ala Ser Trp Ser Ser Glu Leu Glu Asp 
1 - - 5 - 1Q 15 



Leu Gin Lys Lys Pro Pro Pro Tyr Leu Arg Asn Leu Ser Ser Asp Asp 
20 25 30 



Ala Asn Val Leu Val Trp His Ala Leu Leu Leu Pro Asp Gin Pro Pro 
35 40 45 

Tyr His Leu Lys Ala Phe Asn Leu Arg lie" Ser Phe Pro Pro Glu Tyr 
50 ^ 55 60 



Pro Phe Lys Pro Pro Met He Lys Phe Thr Thr Lys He Tyr His Pro 
65 70 75 80 
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Asn Val Asp Glu Asn Gly Gin lie Cys Leu Pro lie lie Ser Ser Glu 
85 90 95 



Asn Trp Iiys Pro Cys Thr Lys Thr Cys Gin Val Leu Glu Ala Leu Asn 
100 105 110 



Val Leu Val Asn Arg Pro Asn lie Arg Glu Pro Leu Arg Met Asp Leu 
115 120 125 



Ala Asp Leu Leu Thr Gin Asn Pro Glu Leu Phe Arg Lys Asn Ala Glu 
130 135 140 



Glu Phe Thr Leu Arg Phe Gly Val Asp Arg Pro Ser 
145 150 155 



<210> 197 

<211> 168 

<212> PRT 

<213> Homo sapien 

<400> 197 

Asn Leu Gly Gin Ser Ser Leu Ser lie Leu Trp Arg Lys Arg Cys Gly 
15 10 15 



Gly Gly Pro Gly Arg Pro Pro Gly Glu Leu Glu Asp Leu Gin Lys Lys 
20 25 30 



Pro Pro Pro Tyr Leu Arg Asn Leu Ser Ser Asp Asp Ala Asn Val Leu 
35 40 45 



Val Trp His Ala Leu Leu Leu Pro Asp Gin Pro Pro Tyr His Leu Lys 
50 55 60 



Ala Phe Asn Leu Arg lie Ser Phe Pro Pro Glu Tyr Pro Phe Lys Pro 
65 70 75 80 



Pro Met lie Lys Phe Thr Thr Lys lie Tyr His Pro Asn Val Asp Glu 
85 90 95 



Asn Gly Gin He Cys Leu Pro He He Ser Ser Glu Asn Trp Lys Pro 
100 105 HO 



Cys Thr Lys Thr Cys Gin Val Leu Glu Ala Leu Asn Val Leu Val Asn 
115 * 12 0 125 
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Arg Pro Asn lie Arg Glu Pro Leu Arg Met Asp Leu Ala Asp Leu Leu 
130 135 140 

Thr Gin Asn Pro Glu Leu Phe Arg Lys Asn Ala Glu Glu Phe Thr Leu 
145 150 155 160 



Arg Phe Gly Val Asp Arg Pro Ser 
165 



<210> 198 

<211> 137 

<212> PRT 

<213> Homo sapien 

<400> 198 

Met Glu Gly Arg Arg Gin Gly Asn Leu Ser Ser Asp Asp Ala Asn Val 
15 10 15 

Leu Val Trp His Ala Leu Leu Leu Pro Asp Gin Pro Pro Tyr His Leu 
20 25 30 

Lys Ala Phe Asn Leu Arg lie Ser Phe Pro Pro Glu Tyr Pro Phe Lys 
35 40 45 

Pro Pro Met lie Lys Phe Thr Thr Lys He Tyr His Pro Asn Val Asp 
50 55 60 

Glu Asn Gly Gin He Cys Leu Pro He He Ser Ser Glu Asn Trp Lys 
65 70 75 80 

Pro Cys Thr Lys Thr Cys Gin Val Leu Glu Ala Leu Asn Val Leu Val 
85 90 95 

Asn Arg Pro Asn He Arg Glu Pro Leu Arg Met Asp Leu Ala Asp Leu 
100 105 HO 

Leu Thr Gin Asn Pro Glu Leu Phe Arg Lys Asn Ala Glu Glu Phe Thr 
115 120 125 



Leu Arg Phe Gly Val Asp Arg Pro Ser 
130 * 135 



<210> 199 

<211> 237 

<212> PRT 

<213> Homo sapien 



<400> 199 
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Met Met Ala Ser Met Arg Val Val Lys Glu Leu Glu Asp Leu Gin Lys 
15 10 15 

Lys Pro Pro Pro Tyr Leu Arg Asn Leu Ser Ser Asp Asp Ala Asn Val 
20 25 30 



Leu Val Trp His Ala Leu Leu Leu Pro Glu Ala Glu Val Ala Val Ser 
35 40 45 

Arg Asp His Ala lie Ala Leu Gin Pro Gly Gin Gin Ser Glu Thr Pro 
50 55 60 

Ser Gin Lys Lys Lys Lys Lys Glu Ala Trp His Gin His Leu Leu Leu 
65 " 70 75 80 

Met Arg Pro Ser Gly Ser Phe His Ser Trp Trp Lys Ala Lys Gly Ser 
85 90 95 



His Val Tyr Arg Ser His Ala Arg Glu Glu Val Lys Glu Arg Glu Ser 
100 105 110 



Glu Gin Val Pro Gly Ser Ser Lys Gin Pro Ala Phe Ser Asp Gin Pro 
115 120 125 

Pro Tyr His Leu Lys Ala Phe Asn Leu Arg lie Ser Phe Pro Pro Glu 
130 135 140 

Tyr Pro Phe Lys Pro Pro Met lie Lys Phe Thr Thr Lys He Tyr His 
145 ~ 150 155 160 

Pro Asn Val Asp Glu Asn Gly Gin He Cys Leu Pro He He Ser Ser 
165 170 175 



Glu Asn Trp Lys Pro Cys Thr Lys Thr Cys Gin Val Leu Glu Ala Leu 
180 185 190 



Asn Val Leu Val Asn Arg Pro Asn He Arg Glu Pro Leu Arg Met Asp 
195 200 205 

Leu Ala Asp Leu Leu Thr Gin Asn Pro Glu Leu Phe Arg Lys Asn Ala 
210 215 220 



Glu Glu Phe Thr Leu Arg Phe Gly Val Asp Arg Pro Ser 
225 230 235 
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<210> 200 
<211> 156 
<212> PRT 
<213> Homo sapien 

<400> 200 

Gly Pro Gin Glu Ala Ser Thr His Gly Gly Arg Gin Arg Gly Ala Thr 
! 5 10 15 

Cys Thr Asp His He Ala Arg Glu Glu Val Lys Glu Arg Glu Ser Glu 
20 25 30 

Gin Val Pro Gly Ser Ser Lys Gin Pro Ala Phe Ser Aep Gin Pro Pro 
35 40 45 

Tyr His Leu Lys Ala Phe Asn Leu Arg He Ser Phe Pro Pro Glu Tyr 
50 55 60 

Pro Phe Lys Pro Pro Met He Lys Phe Thr Thr Lys He Tyr His Pro 
65 70 75 80 

Asn Val Asp Glu Asn Gly Gin He Cys Leu Pro He He Ser Ser Glu 
85 90 95 

Asn Trp Lys Pro Cys Thr Lys Thr Cys Gin Val Leu Glu Ala Leu Asn 
100 ^ 105 HO 

Val Leu Val Asn Arg Pro Asn He Arg Glu Pro Leu Arg Met Asp Leu 
115 ~ 120 125 

Ala Asp Leu Leu Thr Gin Asn Pro Glu Leu Phe Arg Lys Asn Ala Glu 
130 135 140 



Glu Phe Thr Leu Arg Phe Gly Val Asp Arg Pro Ser 
145 150 155 



<210> 201 

<211> 88 

<212> PRT 

<213> Homo sapien 

<400> 201 

Met Val Gin Ala Gly Pro Ser Ser Cys Ser He Ser Gly Asp Pro Gly 
1 \ " 5 10 " 15 



Leu Pro Arg Arg Trp Arg Pro Ala Gin Val Val Arg Pro Gly Arg Leu 
20 25 30 
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Arg He Arg Gly Trp Ser Arg Arg He Pro Lys Ala Glu Val Gly Ser 
35 40 45 

Pro Gly Asp Ser Gin Leu Leu Ser Leu Trp Arg Arg Gly Pro Val Thr 
50 55 60 

Glu Ala Pro Phe Ser Asn Pro Gly Ala Ala Phe Gin Arg Leu Asn Phe 
65 70 75 80 



Ser Asn His Cys Phe Asn Ser Phe 
85 



<210> 202 

<211> 40 

<212> PRT 

<213> Homo sapien 

<400> 202 

Met Glu Lys Gly Val Gly Gly Gin Pro Arg Gly Arg Arg Arg He Tyr 
15 10 15 

Asn He Phe Phe Arg His Arg Cys Tyr Arg Lys Met Cys Glu Arg Ser 
20 25 30 



Gly Cys Ala Ala Arg Thr Gly Ala 
35 40 



<210> 203 

<211> 60 

<212> PRT 

<213> Homo sapien 

<400> 203 

Gly Pro Glu Lys Trp Arg Arg Gly Trp Gly Asp Ser His Val Ala Ala 
1 5 " 10 15 

Gly Gly Phe Thr Thr Phe Ser Phe Ala He Asp Val He Ala Lys Cys 
20 25 30 

Val Arg Glu Ala Ala Ala Gin Pro Gly Arg Glu Arg Glu Gly Ala Gly 
35 40 45 



Gin Arg Phe Pro Pro- Thr Gly Asn Leu Met Gly Leu" 
50 55 60 



<210> 204 
<211> 213 
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<212> PRT 

<213> Homo sapien 

<220> 

<221> MI S COFEATURE 

<222> (5) . . (5) 

<223> X=any amino acid 



<400> 204 



Met Pro Gin Asn Xaa Gly Val lie Gly Leu Arg His His Phe Ala He 
15 10 15 

His Tyr Pro Ala Gly Gly Gly Leu Trp Asp Gly Leu His Gly Val Ala 
20 25 30 

Ala Val Gin Gly He Thr Lys He Lys Val Leu Ala He Tyr Ser Phe 
35 40 45 

Cys Ser Gin He Cys Asp Pro Arg Thr Thr Gly Ala Phe Trp Gin Thr 
50 55 60 

Trp Lys Asp Phe Glu Val Arg His Gly Asn Glu Asp Thr He Lys Glu 
65 70 75 80 

Met Leu Arg He Arg Arg Ser Val Gin Ala Thr Tyr Asn Thr Gin Val 
85 90 95 

Asn Phe Met Ala Ser Gin Met Leu Lys Val Ser Gly Ser Ala Thr Gly 
100 105 HO 



Thr Val Ser Asp Leu Ala Pro Gly Gin Ser Gly Met Asp Asp Met Lys 
115 120 125 



Leu Leu Glu Gin Arg Ala Glu Gin Leu Ala Ala Glu Ala Glu Arg Asp 
130 135 140 

Gin Pro Leu Arg Ala Gin Ser Lys He Leu Phe Val Arg Ser Asp Ala 
145 150 155 160 

Ser Arg Glu Glu Leu Ala Glu Leu Ala Gin Gin Val Asn Pro Glu Glu 
165 170 175 

He "Gin Leu Giy Glu Asp' Giu Asp Glu Asp Glii Met Asp Leu Glu Pro 
180 185 190 



Asn Glu Val Arg Leu Glu Gin Gin Ser Val Pro Ala Ala Val Phe Gly 
195 200 205 
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Ser Leu Lys Glu Asp 
210 



<210> 205 

<211> 458 

<212> PRT 

<213> Homo sapien 



<400> 205 

Met lie Asp His Tyr Arg Gly Met Gly Pro Leu Met Leu Leu Glu Arg 
15 10 15 

Arg Ser Val Met Asp Arg Gly Arg Gly Arg Tyr Gin Tyr Ser Pro Gin 
20 25 30 

Asn Gin His Val Glu Gin Gin Pro His Tyr Thr His Lys Pro Thr Leu 
35 40 45 

Glu Tyr Ser Pro Phe Pro lie Pro Pro Gin Ser Pro Ala Tyr Glu Pro 
50 55 60 

Asn Leu Phe Asp Gly Pro Glu Ser Gin Phe Cys Pro Asn Gin Ser Leu 
65 70 75 80 

Val Ser Leu Leu Gly Asp Gin Arg Glu Ser Glu Asn lie Ala Asn Pro 
85 90 95 



Met Gin Thr Ser Ser Ser Val Gin Gin Gin Asn Asp Ala His Leu His 
100 105 110 



Ser Phe Ser Met Met Pro Ser Ser Ala Cys Glu Ala Met Val Gly His 
115 120 125 



Glu Met Ala Ser Asp Ser Ser Asn Thr Ser Leu Pro Phe Ser Asn Met 
130 135 140 

Gly Asn Pro Met Asn Thr Thr Gin Leu Gly Lys Ser Leu Phe Gin Trp 
145 150 155 160 

Gin Val Glu Gin Glu Glu Ser Lys Leu Ala Asn He Ser Gin Asp Gin 
165 170 175 



Phe Leu Ser Lys Asp Ala Asp Gly Asp Thr Phe Leu His He Ala Val 
180 185 190 
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Ala Gin Gly Arg Arg Ala Leu Ser Tyr Val Leu Ala Arg Lys Met Asn 
195 200 205 



Ala Leu His Met Leu Asp lie Lys Glu His Asn Gly Gin Ser Ala Phe 
210 215 220 



Gin Val Ala Val Ala Ala Asn Gin His Leu He Val Gin Asp Leu Val 
225 230 235 240 



Asn He Gly Ala Gin Val Asn Thr Thr Asp Cys Trp Gly Arg Thr Pro 
245 250 255 



Leu His Val Cys Ala Glu Lys Gly His Ser Gin Val Leu Gin Ala He 
260 265 270 



Gin Lys Gly Ala Val Gly Ser Asn Gin Phe Val Asp Leu Glu Ala Thr 
275 280 285 



Asn Tyr Asp Gly Leu Thr Pro Leu His Cys Ala Val He Ala His Asn 
290 295 300 



Ala Val Val His Glu Leu Gin Arg- Asn Gin Gin Pro His Ser Pro Glu 
305 310 315 320 



Val Gin Glu Leu Leu Leu Lys Asn Lys Ser Leu Val Asp Thr He Lys 
325 330 335 



Cys Leu He Gin Met Gly Ala Ala Val Glu Ala Lys Asp Arg Lys Ser 
340 345 350 



Gly Arg Thr Ala Leu His Leu Ala Ala Glu Glu Ala Asn Leu Glu Leu 
355 360 365 



He Arg Leu Phe Leu Glu Leu Pro Ser Cys Leu Ser Phe Val Asn Ala 
370 375 380 



Lys Ala Tyr Asn Gly Asn Thr Ala Leu His Val Ala Ala Ser Leu Gin 
385 390 395 400 



Tyr Arg Leu Thr Gin Leu Asp Ala Val Arg Leu Leu Met Arg Lys Gly 
405 410 415 



Ala Asp Pro Ser Thr Arg Asn Leu Glu Asn Glu Gin Pro Val His Leu 
420 425 430 



Val Pro Asp Gly Pro Val Gly Glu Gin He Arg Arg He Leu Lys Gly 
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435 440 445 



Lys Ser He Gin Gin Arg Ala Pro Pro Tyr 
450 455 



<210> 206 

<211> 439 

<212> PRT 

<213> Homo sapien 

<400> 206 

Trp He Val Val Ala Ala Arg Tyr Gin Tyr Ser Pro Gin Asn Gin His 
1 5 ~ 10 15 



Val Glu Gin Gin Pro His Tyr Thr His Lys Pro Thr Leu Glu Tyr Ser 
20 25 30 

Pro Phe Pro He Pro Pro Gin Ser Pro Ala Tyr Glu Pro Asn Leu Phe 
35 40 45 

Asp Gly Pro Glu Ser Gin Phe Cys Pro Asn Gin Ser Leu Val Ser Leu 
50 55 60 

Leu Gly Asp Gin Arg Glu Ser Glu Asn He Ala Asn Pro Met Gin Thr 
65 70 75 80 

Ser Ser Ser Val Gin Gin Gin Asn Asp Ala His Leu His Ser Phe Ser 
85 90 95 

Met Met Pro Ser Ser Ala Cys Glu Ala Met Val Gly His Glu Met Ala 
100 105 HO 

Ser Asp Ser Ser Asn Thr Ser Leu Pro Phe Ser Asn Met Gly Asn Pro 
115 120 125 

Met Asn Thr Thr Gin Leu Gly Lys Ser Leu Phe Gin Trp Gin Val Glu 
130 * 135 140 \ 

Gin Glu Glu Ser Lys Leu Ala Asn He Ser Gin Asp Gin Phe Leu Ser 
145 150 155 160 

Lys Asp Ala Asp Gly Asp Thr Phe Leu His He Ala Val Ala Gin Gly 
165 " 170 - - 175 



Arg Arg Ala Leu Ser Tyr Val Leu Ala Arg Lys Met Asn Ala Leu His 
180 185 190 



WO 2004/050900 



PCT/US2003/040131 



278 

Met Leu Asp He Lys Glu His Asn Gly Gin Ser Ala Phe Gin Val Ala 
195 200 205 

Val Ala Ala Asn Gin His Leu He Val Gin Asp Leu Val Asn He Gly 
210 215 220 

Ala Gin Val Asn Thr Thr Asp Cys Trp Gly Arg Thr Pro Leu His Val 
225 230 235 240 

Cys Ala Glu Lys Gly His Ser Gin Val Leu Gin Ala He Gin Lys Gly 
245 250 255 

Ala Val Gly Ser Asn Gin Phe Val Asp Leu Glu Ala Thr Asn Tyr Asp 
260 265 270 

Gly Leu Thr Pro Leu His Cys Ala Val He Ala His Asn Ala Val Val 
275 280 285 

His Glu Leu Gin Arg Asn Gin Gin Pro His Ser Pro Glu Val Gin Glu 
290 295 300 

Leu Leu Leu Lys Asn Lys Ser Leu Val Asp Thr He Lys Cys Leu He 
305 310 315 320 

Gin Met Gly Ala Ala Val Glu Ala Lys Asp Arg Lys Ser Gly Arg Thr 
325 330 335 



Ala Leu His Leu Ala Ala Glu Glu Ala Asn Leu Glu Leu He Arg Leu 
340 345 350 

Phe Leu Glu Leu Pro Ser Cys Leu Ser Phe Val Asn Ala Lys Ala Tyr 
355 360 365 

Asn Gly Asn Thr Ala Leu His Val Ala Ala Ser Leu Gin Tyr Arg Leu 
370 375 380 

Thr Gin Leu Asp Ala Val Arg Leu Leu Met Arg Lys Gly Ala Asp Pro 
385 390 395 400 

Ser Thr Arg Asn Leu Glu Asn Glu Gin Pro Val His Leu Val Pro Asp 
405 410 415 

Gly Pro Val Gly Glu Gin He Arg Arg He Leu Lys Gly Lys Ser He 
420 425 430 
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Gin Gin Arg Ala Pro Pro Tyr 
435 



<210> 207 

<211> 130 

<212> PRT 

<213> Homo sapien 

<400> 207 

Met Gin Pro Leu Trp Leu Cys Trp Ala Leu Trp Val Leu Pro Leu Ala 
! 5 10 15 

Ser Pro Gly Ala Ala Leu Thr Gly Glu Gin Leu Leu Gly Ser Leu Leu 
20 25 30 

Arg Gin Leu Gin Leu Lys Glu Val Pro Thr Leu Asp Arg Ala Asp Met 
35 40 45 

Glu Glu Leu Val lie Pro Thr His Val Arg Ala Gin Tyr Val Ala Leu 
50 55 60 

Leu Gin Arg Ser His Gly Asp Arg Ser Arg Gly Lys Arg Phe Ser Gin 
65 ~ 70 75 80 

Ser Phe Arg Glu Val Ala Gly Arg Phe Leu Ala Leu Glu Ala Ser Thr 
85 90 95 

His Leu Leu Val Phe Gly Met Glu Gin Arg Leu Pro Pro Asn Ser Glu 
100 105 HO 



Leu Val Gin Ala Val Leu Arg Leu Phe Gin Glu Cys Thr Leu Thr Cys 
115 120 125 



Arg Gly 
130 



<210> 208 

<211> 243 

<212> PRT 

<213> Homo sapien 

<400> 208 

Asp Pro Pro Ala Ala Phe Ser Arg Asp Ser Pro Thr Leu Pro Leu Ala 
1 5 10 15 



Pro Pro Gly Gin His His Ala Ala Pro Val Ala Leu Leu Gly Thr Leu 
20 25 30 
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Gly Val Ala Pro Gly Gin Pro Arg Gly Arg Pro Asp Arg Gly Ala Ala 
35 40 45 

Pro Gly Gin Pro Ala Ala Ala Ala Ala Ala Gin Arg Gly Ala His Pro 
50 55 60 

Gly Gin Gly Arg His Gly Gly Ala Gly His Pro His Pro Arg Glu Gly 
65 "* 70 75 80 

Pro Val Arg Gly Pro Ala Ala Ala Gin Pro Arg Gly Pro Leu Pro Arg 
85 90 95 

Lys Glu Val Gin Pro Glu Leu Pro Arg Gly Gly Arg Gin Val Pro Gly 
100 105 HO 

Val Gly Gly Gin His Thr Pro Ala Gly Val Arg His Gly Ala Ala Ala 
115 120 125 

Ala Ala Gin Gin Arg Ala Gly Ala Gly Arg Ala Ala Ala Leu Pro Gly 
130 135 140 

Met Tyr He Asp Leu Gin Gly Met Lys Trp Ala Glu Asn Trp Val Leu 
145 *" 150 155 160 

Glu Pro Pro Gly Phe Leu Ala Tyr Glu Cys Val Gly Thr Cys Arg Gin 
165 170 175 

Pro Pro Glu Ala Leu Ala Phe Lys Trp Pro Phe Leu Gly Pro Arg Gin 
180 185 190 

Cys He Ala Ser Glu Thr Asp Ser Leu Pro Met He Val Ser He Lys 
195 200 205 



Glu Gly Gly Arg Thr Arg Pro Gin Val Val Ser Leu Pro Asn Met Arg 
210 215 220 



Val Gin Lys Cys Ser Cys Ala Ser Asp Gly Ala Leu Val Pro Arg Arg 
225 230 235 240 



Leu Gin Pro 



<210> 209 

<211> 204 

<212> PRT 

<213> Homo sapien 
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<400> 209 



Met Glu Arg Leu Thr Leu Pro Leu Gly Gly Ala Ala Ala Val Asp Glu 
15 10 15 

Tyr Leu Glu Tyr Arg Arg Tyr Lys Gin His Lys Thr Asp Leu Glu Ala 
20 25 30 

He Pro Gin Gin Cys Pro He Asp Leu Pro Cys Gin Val Thr Gly Cys 
35 40 45 

Gin Cys Arg Ala Tyr Leu Tyr Val Pro Leu Asn Gly Ser Gin Pro He 
50 55 60 

Arg Cys Arg Cys Lys His Phe Ala Asp Gin His Ser Ala Ala Pro Gly 
65 ~ * 70 75 80 

Phe Thr Cys Asn Thr Cys Ser Lys Cys Ser Gly Phe His Ser Cys Phe 
85 90 95 

Thr Cys Ala Cys Gly Gin Pro Ala Tyr Ala His Asp Thr Val Val Glu 
100 ^ 105 HO 

Thr Lys Gin Glu Arg Leu Ala Gin Glu Lys Pro Val Gly Gin Asp He 
115 120 125 

Pro Tyr Ala Ala Met Gly Gly Leu Thr Gly Phe Ser Ser Leu Ala Glu 
130 135 140 

Gly Tyr Met Arg Leu Asp Asp Ser Gly He Val Gly Thr Ser Ser Gin 
145 150 155 160 

Val Ser Ser Leu Arg Arg Pro Glu Glu Asp Asp Met Ala Phe Phe Glu 
165 170 175 



Arg Arg Tyr Gin Glu Arg Met Lys Met Glu Lys Ala Ala Lys Trp Lys 
180 185 190 



Gly Lys Ala Pro Leu Pro Ser Ala Thr Lys Pro Ser 
195 200 



<210> 210 

<211> 80 

<212> PRT 

<213> Homo sapien 



<400> 210 
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Glu Val Gin Glu Ala lie Phe Phe Arg Val Cys Gly Ala Arg Ser Val 
15 10 15 

Val Leu Leu Leu Val Ala Val Arg Leu His Thr Leu Leu Ser Cys Pro 
20 25 30 

Leu Glu Gin Pro Ala Gly Thr Glu Trp He Leu Glu Glu Gly Val Thr 
35 40 45 

Thr Gly Pro Pro Arg Lys Pro Arg Ala Asp He Tyr Asn Leu Arg Ser 
50 55 60 

Pro Abp Glu Phe He Val Gly Gin Asn Gin Ala Leu He Glu Pro Gly 
65 70 75 80 



<210> 211 

<211> 84 

<212> PRT 

<213> Homo sapien 

<400> 211 

Glu Gin Gin Pro Ser Pro He Asp Ser Thr Glu Thr Thr Arg Asn Gin 
! 5 10 15 

Gin Val Arg Pro Thr Thr Ser Arg Asn Lys Arg Arg Ala Ala Ser Gin 
20 25 30 

His He Ser Lys Ala Thr Arg Pro Thr Ala Lys Arg Gin Ala Ala Asp 
35 40 45 

Ser Asp He Thr Thr Ser Gly Pro Thr Ala Thr Thr Thr Asp Asp Lys 
50 55 60 

Asn Asp Val Cys Glu Asp Thr Pro His Arg Arg Thr Thr Gly Trp His 
65 ' 70 75 80 



Gin Arg Asp Leu 



<210> 212 
<211> 56 

<212> _ PRT . . 

<213> Homo sapien 

<400> 212 

Pro Leu Trp Arg Arg Leu Leu Leu Gly Ser Arg Leu Leu Leu Pro Cys 
1^5 10 15 
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Asn Arg Asn Trp Arg Trp Asn Met Arg Gly Ala Leu Trp Lys Glu Lys 
20 25 30 



Asp Arg Pro Cys Ala Phe Met Iiys Val Lys lie Trp Leu Asn Gin Phe 
35 40 45 



His Lys Val Thr Val Tyr lie Ala 
50 55 



<210> 213 

<211> 451 

<212> PRT 

<213> Homo sapien 

<400> 213 

Met Phe Leu Leu Leu Leu His Leu Gin lie Lys Trp Arg Ala Thr lie 
15 10 15 



Asn Leu Leu Ser Val Thr Glu Asp Gly Leu His Phe Val Glu Tyr Tyr 
20 25 30 



Leu Asn Arg lie lie His Leu Asp Val Asp Ser Glu Ala Lys Lys Leu 
35 40 45 



Leu Gly Leu Gly Gin Lys His Leu Val Met Gly Asp lie Pro Ala Ala 
50 55 60 



Val Asn Ala Phe Gin Glu Ala Ala Ser Leu Leu Gly Lys Lys Tyr Gly 
65 70 75 80 



Glu Thr Ala Asn Glu Cys Gly Glu Ala Phe Phe Phe Tyr Gly Lys Ser 
85 90 95 



Leu Leu Glu Leu Ala Arg Met Glu Asn Gly Val Leu Gly Asn Ala Leu 
100 " 105 110 



Glu Gly Val His Val Glu Glu Glu Glu Gly Glu Lys Thr Glu Asp Glu 
115 120 125 



Ser Leu Val Glu Asn Asn Asp Asn He Asp Glu Thr Glu Gly Ser Glu 
130 135 140 



Glu Asp Asp Lys Glu Asn Asp Lys Thr Glu Glu Met Pro Asn Asp Ser 
145 150 155 160 
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Val Leu Glu Asn Lys Ser Leu Gin Glu Asn Glu Glu Glu Glu He Gly 
165 170 175 



Asn Leu Glu Leu Ala Trp Asp Met Leu Asp Leu Ala Lys He He Phe 
180 165 190 



Lys Arg Gin Glu Thr Lys Glu Ala Gin Leu Tyr Ala Ala Gin Ala His 
195 ^ 200 205 



Leu Lys Leu Gly Glu Val Ser Val Glu Ser Glu Asn Tyr Val Gin Ala 
210 215 220 



Val Glu Glu Phe Gin Ser Cys Leu Asn Leu Gin Glu Gin Tyr Leu Glu 
225 230 235 240 



Ala His Asp Arg Leu Leu Ala Glu Thr His Tyr Gin Leu Gly Leu Ala 
245 250 255 



Tyr Gly Tyr Asn Ser Gin Tyr Asp Glu Ala Val Ala Gin Phe Ser Lys 
260 265 270 



Ser He Glu Val He Glu Asn Arg Met Ala Val Leu Asn Glu Gin Val 
275 280 285 



Lys Glu Ala Glu Gly Ser Ser Ala Glu Tyr Lys Lys Glu He Glu Glu 
290 295 300 



Leu Lys Glu Leu Leu Pro Glu He Arg Glu Lys He Glu Asp Ala Lys 
305 310 315 320 



Glu Ser Gin Arg Ser Gly Asn Val Ala Glu Leu Ala Leu Lys Ala Thr 
325 330 335 



Leu Val Glu Ser Ser Thr Ser Gly Phe Thr Pro Gly Gly Gly Gly Ser 
340 345 350 



Ser Val Ser Met He Ala Ser Arg Lys Pro Thr Asp Gly Ala Ser Ser 
355 360 365 



Ser Asn Cys Val Thr Asp He Ser His Leu Val Arg Lys Lys Arg Lys 
370 375 380 



Pro Glu Glu Glu Ser Pro Arg Lys Asp Asp Ala Lys Lys Ala Lys Gin 
385 390 395 400 



Glu Pro Glu Val Asn Gly Gly Ser Gly Asp Ala Val Pro Ser Gly Asn 
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405 410 415 

Glu Val Ser Glu Asn Met Glu Glu Glu Ala Glu Asn Gin Ala Glu Ser 
420 425 430 

Arg Ala Ala Val Glu Gly Thr Val Glu Ala Gly Ala Thr Val Glu Ser 
435 440 445 



Thr Ala Cys 
450 



<210> 214 

<211> 337 

<212> PRT 

<213> Homo sapien 

<400> 214 

Met Ala His Ala Pro Ala Arg Cys Pro Ser Ala Arg Gly Ser Gly Asp 
15 10 15 

Gly Glu Met Gly Lys Pro Arg Asn Val Ala Leu He Thr Gly He Thr 
20 25 30 

Gly Gin Asp Gly Ser Tyr Leu Ala Glu Phe Leu Leu Glu Lys Gly Tyr 
35 40 45 

Glu Val His Gly He Val Arg Arg Ser Ser Ser Phe Asn Thr Gly Arg 
50 55 60 

He Glu His Leu Tyr Lys Asn Pro Gin Ala His He Glu Gly Asn Met 
65 70 75 80 

Lys Leu His Tyr Gly Asp Leu Thr Asp Ser Thr Cys Leu Val Lys He 
85 90 95 



He Asn Glu Val Lys Pro Thr Glu He Tyr Asn Leu Gly Ala Gin Ser 
100 105 HO 



His Val Lys He Ser Phe Asp Leu Ala Glu Tyr Thr Ala Asp Val Asp 
115 120 125 

Gly_Val Gly Thr Leu Arg Leu Leu Asp Ala Val Lys Thr Cys Gly Leu 
"130 " " ' " 135 140 

He Asn Ser Val Lys Phe Tyr Gin Ala Ser Thr Ser Glu Leu Tyr Gly 
145 150 155 160 
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Lys Val Gin Glu lie Pro Gin Lys Glu Thr Thr Pro Phe Tyr Pro Arg 
165 170 175 

Ser Pro Tyr Gly Ala Asn Phe Val Thr Arg Lys He Ser Arg Ser Val 
180 185 190 

Ala Lys He Tyr Leu Gly Gin Leu Glu Cys Phe Ser Leu Gly Asn Leu 
195 200 205 

Asp Ala Lys Arg Asp Trp Gly His Ala Lys Asp Tyr Val Glu Ala Met 
210 215 220 

Trp Leu Met Leu Gin Asn Asp Glu Pro Glu Asp Phe Val He Ala Thr 
225 230 235 240 

Gly Glu Val His Ser Val Arg Glu Phe Val Glu Lys Ser Phe Leu His 
245 250 255 

He Gly Lys Thr He Val Trp Glu Gly Lys Asn Glu Asn Glu Val Gly 
260 265 270 

Arg Cys Lys Glu Thr Gly Lys Val His Val Thr Val Asp Leu Lys Tyr 
275 280 285 

Tyr Arg Pro Thr Glu Val Asp Phe Leu Gin Gly Asp Cys Thr Lys Ala 
290 295 300 

Lys Gin Lys Leu Asn Trp Lys Pro Arg Val Ala Phe ABp Glu Leu Val 
305 310 315 320 

Arg Glu Met Val His Ala Asp Val Glu Leu Met Arg Thr Asn Pro Asn 
325 330 335 



Ala 



<210> 215 

<211> 332 

<212> PRT 

<213> Homo sapien 

<400> m 215 

Met Ala His Ala Pro Ala Arg Cys Pro Ser Ala Arg Gly Ser Gly Asp 
15 10 15 



Gly Glu Met Gly Lys Pro Arg Asn Val Ala Leu He Thr Gly He Thr 
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20 25 30 

Gly Gin Asp Gly Ser Tyr Leu Ala Glu Phe Leu Leu Glu Lys Gly Tyr 
35 40 45 

Glu Val His Gly He Val Arg Arg Ser Ser Ser Phe Asn Thr Gly Arg 
50 55 60 

He Glu His Leu Tyr Lys Asn Pro Gin Ala His He Glu Gly Asn Met 
65 70 75 80 

Lys Leu His Tyr Gly Asp Leu Thr Asp Ser Thr Cys Leu Val Lys He 
85 90 95 



He Asn Glu Val Lys Pro Thr Glu He Tyr Asn Leu Gly Ala Gin Ser 
100 105 HO 



His Val Lys He Ser Phe Asp Leu Ala Glu Tyr Thr Ala Asp Val Asp 
115 120 125 

Gly Val Gly Thr Leu Arg Leu Leu Asp Ala Val Lys Thr Cys Gly Leu 
130 135 140 

He Asn Ser Val Lys Phe Tyr Gin Ala Ser Thr Ser Glu Leu Tyr Gly 
145 150 155 160 

Lys Val Gin Glu He Pro Gin Lys Glu Thr Thr Pro Phe Tyr Pro Arg 
165 170 175 



Ser Pro Tyr Gly Ala Ala Lys Leu Tyr Ala Tyr Trp He Val Val Asn 
180 185 190 



Phe Arg Glu Ala Tyr Asn Leu Phe Ala Val Asn Gly He Leu Phe Asn 
195 200 205 



His Glu Ser Pro Arg Arg Gly Ala Asn Phe Val Thr Arg Lys He Ser 
210 215 220 

Arg Ser Val Ala Lys He Tyr Leu Gly Gin Leu Glu Cys Phe Ser Leu 
225 230 235 240 

Gly Asn Leu Asp Ala Lys" Arg Asp Trp Gly His Ala Lys Asp Tyr Val 
245 250 255 



Glu Ala Met Trp Leu Met Leu Gin Asn Asp Glu Pro Glu Asp Phe Val 
260 265 270 



WO 2004/050900 



PCT/US2003/040131 



288 



lie Ala Thr Gly Glu Val His Ser Val Arg Glu Phe Val Glu Lys Ser 
275 ^ 280 285 



Phe Leu His He Gly Lys Thr He Val Trp Glu Gly Lys Asn Glu Asn 
290 295 300 



Glu Val Gly Arg Cys Lys Glu Thr Gly Lys Val His Val Thr Val Asp 
305 * 310 315 320 



Leu Lys Tyr Tyr Arg Pro Thr Glu Val Glu Thr Asn 
325 330 



<210> 216 

<211> 382 

<212> PRT 

<213> Homo sapien 

<400> 216 

Met Ala His Ala Pro Ala Arg Cys Pro Ser Ala Arg Gly Ser Gly Asp 
15 10 15 

Gly Glu Met Gly Lys Pro Arg Asn Val Ala Leu He Thr Gly He Thr 
20 25 30 



Gly Gin Asp Gly Ser Tyr Leu Ala Glu Phe Leu Leu Glu Lys Gly Tyr 
35 40 45 



Glu Val His Gly He Val Arg Arg Ser Ser Ser Phe Asn Thr Gly Arg 
50 55 60 



He Glu His Leu Tyr Lys Asn Pro Gin Ala His He Glu Gly Asn Met 
65 70 75 80 



Lys Leu His Tyr Gly Asp Leu Thr Asp Ser Thr Cys Leu Val Lys He 
85 90 95 



He Asn Glu Val Lys Pro Thr Glu He Tyr Asn Leu Gly Ala Gin Ser 
100 105 HO 



His Val Lys He Ser Phe Asp Leu Ala Glu Tyr Thr Ala Asp Val Asp 
115 120 125 



Gly Val Gly Thr Leu Arg Leu Leu Asp Ala Val Lys Thr Cys Gly Leu 
130 135 140 
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He Asn Ser Val Lys Phe Tyr Gin Ala Ser Thr Ser Glu Leu Tyr Gly 
145 150 155 160 



Lys Val Gin Glu He Pro Gin Lys Glu Thr Thr Pro Phe Tyr Pro Arg 
165 170 175 



Ser Pro Tyr Gly Ala Ala Lys Leu Tyr Ala Tyr Trp He Val Val Asn 
180 185 190 



Phe Arg Glu Ala Tyr Asn Leu Phe Ala Val Asn Gly He Leu Phe Asn 
195 200 205 



His Glu Ser Pro Arg Arg Gly Ala Asn Phe Val Thr Arg Lys He Ser 
210 215 220 



Arg Ser Val Ala Lys He Tyr Leu Gly Gin Leu Glu Cys Phe Ser Leu 

225 230 235 240 

Gly Asn Leu Asp Ala Lys Arg Asp Trp Gly His Ala Lys Asp Tyr Val 

245 250 255 



Glu Ala Met Trp Leu Met Leu Gin Asn Asp Glu Pro Glu Asp Phe Val 
260 265 270 



He Ala Thr Gly Glu Val His Ser Val Arg Glu Phe Val Glu Lys Ser 
275 280 285 



Phe Leu His He Gly Lys Thr He Val Trp Glu Gly Lys Asn Glu Asn 
290 295 300 



Glu Val Gly Arg Cys Lys Glu Thr Gly Lys Val His Val Thr Val Asp 
305 ^ 310 315 320 



Leu Lys Tyr Tyr Arg Pro Thr Glu Val Val Arg Thr Leu Trp Pro Pro 
325 330 335 



Ser Ala Trp Pro Arg Leu Ala Gly Trp Leu Gly Lys Cys Ala His Gly 
340 345 350 



Met Pro Gly Ala Ser Leu Trp Ser Cys Gin Phe Ser Ser Leu Ala Ser 
355 360 365 



Phe Ser Val His Phe Gin Asn Gin Asn Thr Val Asn Ser He 
370 375 380 



<210> 217 
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<211> 258 
<212> PRT 
<213> Homo sapien 

<400> 217 

Met Asn Ser Asn Val Glu Asn Leu Pro Pro His lie lie Arg Leu Val 
15 10 15 



Tyr Lys Glu Val Thr Thr Leu Thr Ala Asp Pro Pro Asp Gly He Lys 
20 25 30 



Val Phe Pro Asn Glu Glu Asp Leu Thr Asp Leu Gin Val Thr He Glu 
35 40 45 



Gly Pro Glu Gly Thr Pro Tyr Ala Gly Gly Leu Phe Arg Met Lys Leu 
50 55 60 



Leu Leu Gly Lys Asp Phe Pro Ala Ser Pro Pro Lys Gly Tyr Phe Leu 
65 70 75 80 



Thr Lys He Phe His Pro Asn Val Gly Ala Asn Gly Glu He Cys Val 
85 90 95 



Asn Val Leu Lys Arg Asp Trp Thr Ala Glu Leu Gly He Arg His Val 
100 105 110 



Leu Leu Thr He Lys Cys Leu Leu He His Pro Asn Pro Glu Ser Ala 
115 120 125 



Leu Asn Glu Glu Ala Gly Arg Leu Leu Leu Glu Asn Tyr Glu Glu Tyr 
130 135 140 



Ala Ala Arg Ala Arg Leu Leu Thr Glu He His Gly Gly Ala Gly Gly 
145 150 155 160 



Pro Ser Gly Arg Ala Glu Ala Gly Arg Ala Leu Ala Ser Gly Thr Glu 
165 170 175 



Ala Ser Ser Thr Asp Pro Gly Ala Pro Gly Arg Arg Ala Glu Val His 
180 185 190 



Trp Pro Glu His Val Gly Arg Glu Arg Arg Trp Gly Arg Lys Thr Thr 
195 " 200 205 



Asp Gly Ala Arg Val Lys Val Phe Leu Ser Arg Asp His Ser Ala Pro 
210 215 220 
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Asn Phe Ser Asn Cys Gly Pro Ser Gly Arg Arg Val Asn Ala Gin Thr 
225 230 235 240 



Lys Lys Pro Ser Arg Lys Gly Val Leu Ser Ala Ala Phe Gin Ala Ser 
245 250 255 



Leu Leu 



<210> 218 

<211> '262 

<212> PRT 

<213> Homo sapien 

<400> 218 

Met Arg Ala Val lie Lys Arg Gin Trp Cys Ala Arg Gly Arg Leu Ser 
1 5 10 15 



Ala Ala Gly His Arg Gly Gly Gly Leu Val Tyr Ala Val Arg Gly Gly 
20 ^ 25 3Q 



Arg Arg Arg Gin Arg Gly Ala Glu Arg Gly Arg Arg Gly Leu Ser Arg 
35 4 0 45 



Ala Ala Ala Ala Ala Val Gly Pro Pro Ala Pro Ala Gly Gly Pro Lys 
50 55 60 



Asn Leu Pro Pro His lie lie Arg Leu Val Tyr Lys Glu Val Thr Thr 
65 70 75 80 



Leu Thr Ala Asp Pro Pro Asp Gly lie Lys Val Phe Pro Asn Glu Glu 
85 90 95 



Asp Leu Thr Asp Leu Gin Val Thr lie Glu Gly Pro Glu Gly Thr Pro 
100 105 110 



Tyr Ala Gly Gly Leu Phe Arg Met Lys Leu Leu Leu Gly Lys Asp Phe 
115 120 125 



Pro Ala Ser Pro Pro Lys Gly Tyr Phe Leu Thr Lys lie Phe His Pro 
130 135 140 



Asn Val Gly Ala Asn Gly Glu lie Cys Val Asn Val Leu Lys Arg Asp 
145 150 155 160 



Trp Thr Ala Glu Leu Gly lie Arg His Val Leu Leu Thr lie Lys Cys 
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165 170 X75 



Leu Leu He His Pro Asn Pro Glu Ser Ala Leu Asn Glu Glu Ala Gly 
180 185 190 



Arg Leu Leu Leu Glu Asn Tyr Glu Glu Tyr Ala Ala Arg Ala Arg Leu 
195 200 205 



Leu Thr Glu He His Gly Gly Ala Gly Gly Pro Ser Gly Arg Ala Glu 
210 215 220 



Ala Gly Arg Ala Leu Ala Ser Gly Thr Glu Ala Ser Ser Thr Asp Pro 
225 230 235 240 



Gly Ala Pro Gly Gly Pro Gly Gly Ala Glu Gly Ser His Gly Gin Glu 
245 250 255 



Ala Cys Trp Arg Ala Arg 
260 



<210> 219 

<211> 291 

<212> PRT 

<213> Homo sapien 

<400> 219 

Gly Ser Glu Leu Arg Gly Arg Gly Gly Arg Gly Leu Arg Ala Val He 
15 10 15 



Lys Arg Gin Trp Cys Ala Arg Gly Arg Leu Ser Ala Ala Gly His Arg 
20 25 30 



Gly Gly Gly Leu Val Tyr Ala Val Arg Gly Gly Arg Arg Arg Gin Arg 
35 40 45 



Gly Ala Glu Arg Gly Arg Arg Gly Leu Ser Arg Ala Ala Ala Ala Ala 
50 55 60 



Val Gly Pro Pro Ala Pro Ala Gly Gly Pro Lys Asn Leu Pro Pro His 
65 70 75 80 



He He Arg Leu Val Tyr Lys Glu Val Thr Thr Leu Thr Ala Asp Pro 
85 • 90 - 95 



Pro Asp Gly He Lys Val Phe Pro Asn Glu Glu Asp Leu Thr Asp Leu 
100 105 110 
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Gin Val Thr lie Glu Gly Pro Glu Gly Thr Pro Tyr Ala Gly Gly Leu 
115 120 125 



Phe Arg Met Lys Leu Leu Leu Gly Lys Asp Phe Pro Ala Ser Pro Pro 
130 135 140 



Lys Gly Tyr Phe Leu Thr Lys lie Phe His Pro Asn Val Gly Ala Asn 
145 " 150 155 160 



Gly Glu lie Cys Val Asn Val Leu Lys Arg Asp Trp Thr Ala Glu Leu 
165 170 175 



Gly lie Arg His Val Leu Leu Thr lie Lys Cys Leu Leu lie His Pro 
180 185 190 



Asn Pro Glu Ser Ala Leu Asn Glu Glu Ala Gly Arg Leu Leu Leu Glu 
195 200 205 



Asn Tyr Glu Glu Tyr Ala Ala Arg Ala Arg Leu Leu Thr Glu lie His 
210 215 220 



Gly Gly Ala Gly Gly Pro Ser Gly Arg Ala Glu Ala Gly Arg Ala Leu 
225 230 235 240 



Ala Ser Gly Thr Glu Ala Ser Ser Thr Asp Pro Gly Ala Pro Gly Gly 
245 250 255 



Pro Gly Gly Ala Glu Gly Pro Met Ala Lys Lys His Ala Gly Glu Arg 
260 265 270 



Asp Lys Lys Leu Ala Ala Lys Lys Lys Thr Asp Lys Lys Arg Ala Leu 
275 280 285 



Arg Arg Leu 
290 



<210> 220 

<211> 233 

<212> PRT 

<213> Homo sapien 

<400> 220 

Met Asn Ser Asn Val Glu Asn Leu Pro Pro His lie lie Arg Leu Val 
15 10 15 



Tyr Lys Glu Val Thr Thr Leu Thr Ala Asp Pro Pro Asp Gly lie Lys 
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20 25 30 

Val Phe Pro Asn Glu Glu Asp Leu Thr Asp Leu Gin Val Thr lie Glu 
35 40 45 



Gly Pro Glu Gly Thr Pro Tyr Ala Gly Gly Leu Phe Arg Met Lys Leu 
50 55 60 



Leu Leu Gly Lys Asp Phe Pro Ala Ser Pro Pro Lys Gly Tyr Phe Leu 
65 " 70 75 80 

Thr Lys lie Phe His Pro Asn Val Gly Ala Asn Gly Glu lie Cys Val 

85 90 95 



Asn Val Leu Lys Arg Asp Trp Thr Ala Glu Leu Gly lie Arg His Val 
100 105 110 



Leu Leu Leu Ser Trp Lys Asp Lys Gin Cys Gin Thr Gin Asp Thr Gin 
115 120 125 



Val Leu Leu Arg Ser Ala Gin Glu His Leu Thr Met Gin Arg Val Thr 
130 135 140 



He Lys Cys Leu Leu He His Pro Asn Pro Glu Ser Ala Leu Asn Glu 
145 ' 150 155 160 



Glu Ala Gly Arg Leu Leu Leu Glu Asn Tyr Glu Glu Tyr Ala Ala Arg 
165 170 175 



Ala Arg Leu Leu Thr Glu He His Gly Gly Ala Gly Gly Pro Ser Gly 
180 185 190 



Arg Ala Glu Ala Gly Arg Ala Leu Ala Ser Gly Thr Glu Ala Ser Ser 
195 200 205 



Thr Asp Pro Gly Ala Pro Gly Gly Pro Gly Gly Ala Glu Gly Ser His 
210 215 220 



Gly Gin Glu Ala Cys Trp Arg Ala Arg 
225 230 



<210> 221 

<211> 390 

<212> PRT 

<213> Homo sapien 



<400> 221 
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Glu Pro Ser Arg Pro Pro Arg Ala Pro lie Gly Arg Pro Ala Thr Gin 
15 10 15 



Pro Ser Pro Pro Leu Pro Ser Leu Leu Thr lie Val Cys Gly Glu Gly 
20 25 30 



Ser Glu Arg Val Glu Asn Gin Gly Thr Cys Ala Leu Thr Ser Arg Leu 
35 40 45 



Arg Leu Gly Ala Glu Gly Pro Arg Arg Ala Trp Pro Ala Gly Gly Tyr 
50 55 60 



Lys Glu Ala Val Val Arg Ala Arg Pro Ala Gin Cys Cys Arg Ala Pro 
65 70 75 80 



Gly Arg Arg Val Gly Leu Arg Cys Ala Arg Arg Thr Ser Glu Ala Ala 
85 90 95 



Gly Ser Gly Ala Gly Pro Pro Gly Pro Leu Gin Gly Arg Ser Gly Ser 
100 105 110 



Ser Trp Ala Pro Arg Pro Gly Arg Arg Thr Glu Glu Arg Arg Lys Gly 
115 120 125 



Ala Gly Gly Thr Arg Pro Arg Pro Ala Ala Ala Met Asn Ser Asn Val 
130 135 140 



Glu Asn Leu Pro Pro His lie He Arg Leu Val Tyr Lys Glu Val Thr 
145 150 155 160 



Thr Leu Thr Ala Asp Pro Pro Asp Gly He Lys Val Phe Pro Asn Glu 
165 170 175 



Glu Asp Leu Thr Asp Leu Gin Val Thr He Glu Gly Pro Glu Gly Thr 
180 185 190 



Pro Tyr Ala Gly Gly Leu Phe Arg Met Lys Leu Leu Leu Gly Lys Asp 
195 2 00 2 05 



Phe Pro Ala Ser Pro Pro Lys Gly Tyr Phe Leu Thr Lys He Phe His 
210 215 220 



Pro Asn Val Gly Ala Asn Gly Glu He Cys Val Asn Val Leu Lys Arg 
225 230 235 240 
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Asp Trp Thr Ala Glu Leu Gly lie Arg His Val Leu Leu Leu Ser Trp 
245 250 255 



Lys Asp Lys Gin Cys Gin Thr Gin Asp Thr Gin Val Leu Leu Arg Ser 
260 265 270 



Ala Gin Glu His Leu Thr Met Gin Arg Val Thr lie Lys Cys Leu Leu 
275 280 285 



lie His Pro Asn Pro Glu Ser Ala Leu Asn Glu Glu Ala Gly Arg Leu 
290 295 300 



Leu Leu Glu Asn Tyr Glu Glu Tyr Ala Ala Arg Ala Arg Leu Leu Thr 
305 310 315 320 



Glu lie His Gly Gly Ala Gly Gly Pro Ser Gly Arg Ala Glu Ala Gly 
325 330 335 



Arg Ala Leu Ala Ser Gly Thr Glu Ala Ser Ser Thr Asp Pro Gly Ala 
340 345 350 



Pro Gly Gly Pro Gly Gly Ala Glu Gly Pro Met Ala Lys Lys His Ala 
355 360 365 



Gly Glu Arg Asp Lys Lys Leu Ala Ala Lys Lys Lys Thr Asp Lys Lys 
370 375 380 



Arg Ala Leu Arg Arg Leu 

390 



385 




<210> 


222 


<211> 


110 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


222 


Pro Gly Ala His Pro 


1 


5 



10 15 



Pro Lys Arg Ala Arg Ser Ala He Thr Lys He Ser Leu Val Leu Tyr 
20 25 30 



Phe Leu Thr Arg Arg Arg Arg Thr Gly Trp Arg Cys Ser Pro Ala Ala 
35 40 45 



Trp Arg Cys Gin Arg Ser Glu Gly Leu Gin Glu Gly Leu Lys Leu Pro 
50 55 60 
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Ala Gin Asn Leu Arg Met Glu Pro Ala Leu His Tyr Leu Arg Ser Gin 
65 70 75 80 



Gly Leu Gly Arg Trp Arg Lys Val lie Ser Pro Ser Leu Lys Ser Tyr 
85 90 95 



Phe Leu Asn Val Ala Pro His Gin Ala Leu Tyr Leu Thr Ser 
100 105 110 



<210> 223 

<211> 257 

<212> PRT 

<213> Homo sapien 

<400> 223 

Met Asp His Arg Ser Arg Leu Arg Gly Thr Gly Leu Asn Arg lie Pro 
1 5 10 15 



Gly Thr Gin Ser Arg Ala Pro Arg Val Pro Leu Pro Phe His Val Gin 
20 25 30 



Gin Glu Ala Arg Glu Gly Glu Asp Trp Glu Arg Glu Pro Pro Arg Gin 
35 40 45 



Arg Pro Pro lie Tyr Glu Pro Pro Glu Ser Glu Glu Leu Pro Asp Asn 
50 55 60 

Val Met Val Ser Lys Pro Ala Pro Tyr Trp Glu Gly Thr Ala Val He 
65 70 75 80 



Asp Gly Glu Phe Lys Glu Leu Lys Leu Thr Asp Tyr Arg Gly Lys Tyr 
85 90 95 



Leu Val Phe Phe Phe Tyr Pro Leu Asp Phe Thr Phe Val Cys Pro Thr 
100 105 110 



Glu He He Ala Phe Gly Asp Arg Leu Glu Glu Phe Arg Ser He Asn 
115 120 125 



Thr Glu Val Val Ala Cys Ser Val Asp Ser Gin Phe Thr His Leu Ala 
130 135 140 



Trp He Asn Thr Pro Arg Arg Gin Gly Gly Leu Gly Pro He Arg He 
145 150 155 160 
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Pro Leu Leu Ser Asp Leu Thr His Gin He Ser Lys Asp Tyr Gly Val 
165 170 175 



Tyr Leu Glu Asp Ser Gly His Thr Leu Arg Gly Leu Phe He He Asp 
180 185 190 



Asp Lys Gly He Leu Arg Gin He Thr Leu Asn Asp Leu Pro Val Gly 
195 200 205 



Arg Ser Val Asp Glu Thr Leu Arg Leu Val Gin Ala Phe Gin Tyr Thr 
210 215 220 



Asp Lys His Gly Glu Val Cys Pro Ala Gly Trp Lys Pro Gly Ser Glu 
225 230 235 240 



Thr He He Pro Asp Pro Ala Gly Lys Leu Lys Tyr Phe Asp Lys Leu 
245 250 255 



Asn 



<210> 224 

<211> 105 

<212> PRT 

<213> Homo sapien 

<400> 224 

Met Gin Lys Lys Lys Asn Ser Asn Ser Asn Ser Gly Thr Ser Ser Phe 
15 10 15 



Gly Lys Arg Arg Asn Lys Thr His Thr Leu Cys Arg Arg Cys Gly Ser 
20 25 30 



Lys Ala Tyr His Leu Gin Lys Ser Thr Cys Gly Lys Cys Gly Tyr Pro 
35 40 45 



Ala Lys Arg Lys Arg Lys Tyr Asn Trp Ser Ala Lys Ala Lys Arg Arg 
50 55 60 



Asn Thr Thr Gly Thr Gly Arg Met Arg His Leu Lys He Val Tyr Arg 
65 70 75 80 



Arg Phe Arg His Gly Phe Arg Glu Gly Thr Thr Pro Lys Pro Lys Arg 
85 90 95 



Ala Ala Val Ala Ala Ser Ser Ser Ser 
100 105 
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<210> 225 

<211> 111 

<212> PRT 

<213> Homo sapien 

<400> 225 

He Phe Met Val Gly Val Asp Ala Lys Lys Lys Glu Phe Glu Phe Glu 
15 10 15 



Phe Gly Thr Ser Ser Phe Gly Lys Arg Arg Asn Lys Thr His Thr Leu 
20 25 30 



Cys Arg Arg Cys Gly Ser Lys Ala Tyr His Leu Gin Lys Ser Thr Cys 
35 40 45 



Gly Lys Cys Gly Tyr Pro Ala Lys Arg Lys Arg Lys Tyr Asn Trp Ser 
50 55 60 



Ala Lys Ala Lys Arg Arg Asn Thr Thr Gly Thr Gly Arg Met Arg His 
65 70 75 80 



Leu Lys He Val Tyr Arg Arg Phe Arg His Gly Phe Arg Glu Gly Thr 
85 90 95 



Thr Pro Lys Pro Lys Arg Ala Ala Val Ala Ala Ser Ser Ser Ser 
100 105 HO 



<210> 226 

<211> 104 

<212> PRT 

<213> Homo sapien 

<220> 

< 2 2 1 > MISC_FEATURE 

<222> (3) . . (5) 

<223> X=any amino acid 



<400> 226 

Met Ser Xaa Xaa Xaa Arg He Arg Pro Arg Gly Thr Ser Ser Phe Gly 
15 10 15 



Lys Arg Arg Asn Lys Thr His Thr Leu Cys Arg Arg Cys Gly Ser Lys 
20 25 30 



Ala Tyr His Leu Gin Lys Ser Thr Cys Gly Lys Cys Gly Tyr Pro Ala 
35 40 45 
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Lys Arg Lys Arg Lys Tyr Asn Trp Ser Ala LyB Ala Lys Arg Arg Asn 
50 55 60 

Thr Thr Gly Thr Gly Arg Met Arg His Leu Lys lie Val Tyr Arg Arg 
65 70 75 80 

Phe Arg His Gly Phe Arg Glu Gly Thr Thr Pro Lys Pro Lys Arg Ala 
85 90 95 



Ala Val Ala Ala Ser Ser Ser Ser 
100 



<210> 227 

<211> 129 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MI SC_FE ATURE 

<222> (12) . . (12) 

<223> X=any amino acid 



<220> 

<221> MIS COFEATURE 

<222> (25) . . (25) 

<223> X=any amino acid 



<220> 

<221> MIS C_FEATURE 

<222> (62) . . (62) 

<223> X=any amino acid 



<220> 

<221> MI SC — FEATURE 

<222> (64) . . (64) 

<223> X=any amino acid 



<400> 227 

Gin Ser His Lys Thr Leu Val Leu Gin Thr Thr Xaa Arg Ser Leu Leu 
1^5 10 15 

Ala His Thr Thr Cys Arg Phe Trp Xaa Phe Pro Asn Leu Leu Gly lie 
20 25 30 



Lys Val Asn Asn Ser lie Thr Arg Gly Ser Gly Gin Pro Ser Phe Val 
35 40 45 



Arg Gly Cys He Val Gly Lys Pro Thr Ser Val Cys Gin Xaa Leu Xaa 
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50 55 60 

Glu Phe Gly Arg Gly Glu Arg His Arg Leu Glu Ser Val Ala lie Arg 
65 70 75 80 



Arg Thr Arg Cys Ala Ala Ala Val Ala Leu Arg Pro Thr Thr Phe Arg 
85 90 95 



Ser Arg Pro Val Ala Asn Val Ala Thr Leu Pro Ser Ala Arg Glu Ser 
100 105 110 



lie Thr Gly Val Pro Arg Leu Lys Asp Glu lie Pro Pro Glu Leu Val 
115 120 125 



Glu 



<210> 228 

<211> 96 

<212> PRT 

<213> Homo sapien 

<400> 228 

Ala Cys Arg Ala Ala Gin Cys Asp Gly Ser Trp Ser Arg Pro Arg Ser 
15 10 15 



Leu Cys Arg Arg Cys Gly Ser Lys Ala Tyr His Leu Gin Lys Ser Thr 
20 25 3 0 



Cys Gly Lys Cys Gly Tyr Pro Ala Lys Arg Lys Arg Lys Tyr Asn Trp 
35 40 45 



Ser Ala Lys Ala Lys Arg Arg Asn Thr Thr Gly Thr Gly Arg Met Arg 
50 55 60 



His Leu Lys He Val Tyr Arg Arg Phe Arg His Gly Phe Arg Glu Gly 
65 70 75 80 



Thr Thr Pro Lys Pro Lys Arg Ala Ala Val Ala Ala Ser Ser Ser Ser 
85 90 95 



<210>_ 229 

<211> 55 

<212> PRT 

<213> Homo sapien 



<400> 229 
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Met His Ala Glu Arg Arg Ser Val Met Asp Arg Gly Arg Gly Arg Gly 
15 10 15 



Arg Pro Thr Thr Phe Arg Ser Arg Pro Val Ala Asn Val Ala Thr Leu 
20 25 30 



Pro Ser Ala Arg Glu Ser He Thr Gly Val Pro Arg Leu Lys Asp Glu 
35 40 45 



He Pro Pro Glu Leu Val Glu 
50 55 



<210> 230 

<211> 72 

<212> PRT 

<213> Homo sapien 

<400> 230 

Ala Tyr His Leu Gin Lys Ser Thr Cys Gly Lys Cys Gly Tyr Pro Ala 
15 10 15 



Lys Arg Lys Arg Lys Tyr Asn Trp Ser Ala Lys Ala Lys Arg Arg Asn 
20 25 30 



Thr Thr Gly Thr Gly Arg Met Arg His Leu Lys He Val Tyr Arg Arg 
35 40 45 



Phe Arg His Gly Phe Arg Glu Gly Thr Thr Pro Lys Pro Lys Arg Ala 
50 55 60 



Ala Val Ala Ala Ser Ser Ser Ser 
65 70 



<210> 231 

<211> 185 

<212> PRT 

<213> Horn© sapien 

<400> 231 

Met Leu Glu Arg Arg Ser Val Asp Gly Cys Ala Arg Ala Gly Gly Arg 
1 5 10 15 



Ala Gly Gly Ala He Met Gly Val Asp He Arg His Asn Lys Asp Arg 
20 25 30 



Lys Val Arg Arg Lys Glu Pro Lys Ser Gin Asp He Tyr Leu Arg Leu 
35 40 45 
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Leu Val Lys Leu Tyr Arg Phe Leu Ala Arg Arg Thr Asn Ser Thr Phe 
50 55 60 



Asn Gin Val Val Leu Lys Arg Leu Phe Met Ser Arg Thr Asn Arg Pro 
65 70 75 80 



Pro Leu Ser Leu Ser Arg Met He Arg Lys Met Lys Leu Pro Gly Arg 
85 90 95 



Glu Asn Lys Thr Ala Val Val Val Gly Thr He Thr Asp Asp Val Arg 
100 105 HO 



Val Gin Glu Val Pro Lys Leu Lys Val Cys Ala Leu Arg Val Thr Ser 
115 120 125 

Arg Ala Arg Ser Arg He Leu Arg Ala Gly Gly Lys He Leu Thr Phe 
130 135 140 

Asp Gin Leu Ala Leu Asp Ser Pro Lys Gly Cys Gly Thr Val Leu Leu 
145 150 155 160 

Ser Gly Pro Arg Lys Gly Arg Glu Val Tyr Arg His Phe Gly Lys Ala 
165 170 175 



Pro Gly Thr Pro His Ser His Thr Lys 
180 185 



<210> 232 

<211> 214 

<212> PRT 

<213> Homo sapien 

<400> 232 

Gly Leu Trp His Cys Pro Ala Leu Arg Ser Ser Gin Gly Pro Arg Gly 
15 10 15 

Val Pro Ala Phe Arg Gin Gly Pro Arg Asn Pro Ala Gin Pro His Gin 
2 0 25 30 



Val Ser He Arg Pro Pro Ala Leu Pro Ser Pro Gin Thr Gin Pro Ala 
35 40 45 



Gly Pro Gly Leu Ala Thr Leu Gly Leu Leu Leu Leu Ser Leu Val Pro 
50 55 60 



Ala Ser Pro Arg Pro Ser Gly Thr Leu Ser Cys Leu He Leu Pro Ala 
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65 70 75 80 



Phe Pro Phe Asn Thr Ala Trp Ser Cys Val Phe Gin Gly Leu Ser Arg 
85 90 95 



His Leu Leu Gly Ser Met Gin Phe Thr Gly Leu Cys Gin Pro Arg Leu 
100 105 110 



Gly Pro Ser Arg Trp Trp Gly Arg Cys Phe His Ser Pro Ser Trp Leu 
115 120 ' 125 



Leu Gly Phe Pro Leu Cys Gin Ala Phe Pro Ala Ala Leu Thr Leu Leu 
130 135 140 



Gly Leu Asn Val Thr Gly Leu Trp Cys Ser Cys Ala Thr Pro Gin Trp 
145 150 155 160 



Pro Pro Leu Arg Gly Pro Pro Ser His Ser Leu Leu Ser Pro Gin Thr 
165 170 175 



Leu Arg Pro Leu Gin Gly Pro Glu Val Arg Ala Cys Gin Arg Pro Thr 
180 185 190 



Gly Gin Pro Arg Leu Gin Lys Leu Thr Leu Asp Pro Thr Leu Leu Leu 
195 200 205 



Lys Arg Phe Leu Leu Thr 
210 



<210> 233 

<211> 131 

<212> PRT 

<213> Homo sapien 

<400> 233 

Met Leu Glu Arg Arg Ser Val Asp Gly Cys Ala Arg Ala Gly Gly Arg 
15 10 15 



Ala Gly Gly Ala lie Met Gly Val Asp He Arg His Asn Lys Asp Arg 
20 25 30 



Lys Val Arg Arg Lys Glu Pro Lys Ser Gin Asp He Tyr Leu Arg Leu 

35 ■ " 40 " 45 



Leu Val Lys Leu Tyr Arg Phe Leu Ala Arg Arg Thr Asn Ser Thr Phe 
50 55 60 
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Asn Gin Val Val Leu Lys Arg Leu Phe Met Ser Arg Thr Asn Arg Pro 
65 70 75 80 

Pro Leu Ser Leu Ser Arg Met lie Arg Lys Met Lys Leu Pro Gly Arg 
85 90 95 

Glu Asn Lys Thr Ala Val Val Val Gly Thr He Thr Asp Asp Val Arg 
100 - 105 HO 

Val Gin Glu Val Pro Lys Leu He Gly Arg Asp His Ala Lys Pro Asp 
115 120 125 



Ser Ser Thr 
130 



<210> 234 

<211> 132 

<212> PRT 

<213> Homo sapien 

<400> 234 

Asp Ala Cys Ser Ser Gly Ala Gly Asp Gly Cys Ala Arg Ala Gly Gly 
! 5 10 15 

Arg Ala Gly Gly Ala He Met Gly Val Asp He Arg His Asn Lys Asp 
20 25 3 0 



Arg Lys Val Arg Arg Lys Glu Pro Lys Ser Gin Asp He Tyr Leu Arg 
35 40 45 

Leu Leu Val Lys Leu Tyr Arg Phe Leu Ala Arg Arg Thr Asn Ser Thr 
50 55 60 

Phe Asn Gin Val Val Leu Lys Arg Leu Phe Met Ser Arg Thr Asn Arg 
65 70 75 80 

Pro Pro Leu Ser Leu Ser Arg Met He Arg Lys Met Lys Leu Pro Gly 
85 90 95 

Arg Glu Asn Lys Thr Ala Val Val Val Gly Thr He Thr Asp Asp Val 
100 105 HO 

Arg Val Gin Glu Val Pro Lys Leu He Gly Arg Asp His Ala Lys Pro 
115 120 125 



Asp Ser Ser Thr 
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13 0 



<210> 235 

<211> 195 

<212> PRT 

<213> Homo eapien 

<400> 235 

Met Asp Trp Ser Arg Arg Gly Gly Arg Ala Gly Gly Ala lie Met Gly 
1^5 10 15 



Val Asp lie Arg His Asn Lys Asp Arg Lys Val Arg Arg Lys Glu Pro 
20 25 30 



Lys Ser Gin Asp lie Tyr Leu Arg Leu Leu Val Lys Leu Tyr Arg Phe 
35 40 45 



Leu Ala Arg Arg Thr Asn Ser Thr Phe Asn Gin Val Val Leu Lys Arg 
50 55 60 



Leu Phe Met Ser Arg Thr Asn Arg Pro Pro Leu Ser Leu Ser Arg Met 
65 70 75 80 

Val Ser Gly Trp Ser Arg Glu His Gly Arg Pro Gly Ser Arg Trp Val 
85 90 95 



Leu Ser Val Trp Lys Gly Gly Arg Thr Trp Ser Ser Gly Ser Asn Gin 
100 105 110 



Gly He Lys Gly Leu Ser Gin Pro Val Ala Ser Val Glu Leu Gly Leu 
115 120 125 



Leu Val Gly Thr Glu Cys Pro Trp Ala Val Gly Lys Ser Pro Gly Pro 
13 0 135 14 0 



Pro Leu Leu Leu Phe Val Arg Trp Arg Cys Pro Gly Gly Phe Arg Arg 

145 150 155 160 

Leu Pro Gin Val He Thr Glu Phe Tyr Val Lys Gly Ser Ala Glu Gly 

165 170 175 



Gly Pro He Glu Gin Ser Ala Phe Phe Phe Leu Ser Gly Ala Phe Pro 
180 " "185" 190 



Ser Trp Thr 
195 
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<210> 236 
<211> 115 
<212> PRT 
<213> Homo sapien 

<400> 236 

Ser Ala Trp Thr Leu Thr Ser Asn Gly Arg Ser Phe Pro Gly Pro Phe 
15 10 15 



Pro Lys Ser Trp Ala Cys Phe Leu Leu Pro Leu Ala lie Leu Cys Pro 
20 25 30 



Cys Gly Cys Ser Pro Thr Leu Arg Ala Val Pro Asp Leu Ser Cys Tyr 
35 40 45 



Phe Pro Lys Pro Glu Thr Ala Pro Leu Gin He Leu Ala Ala Pro Phe 
50 55 60 



Pro Cys Val Gin Tyr Arg Val He Thr Cys Pro Ser Leu Val Pro Leu 
65 70 75 80 



He Leu Pro Cys Asp Tyr Ser Val He Pro Leu Pro Val Pro Glu Pro 
85 90 95 



Pro Gly Leu Phe Leu Gly Ser Pro Glu Cys Ser Pro Arg Thr Gin Ser 
100 105 HO 



Ala Val Pro 
115 



<210> 237 

<211> 156 

<212> PRT 

<213> Homo sapien 

<400> 237 

Gin Ser Leu Gly Arg Gly Leu Ala Thr Thr Arg Gly Arg Gly Ser Asp 
1 5 10 15 



Gly Asn Gly Pro Thr Gly Asn Gly Asp His Pro Asn Phe Ser Leu Ser 
20 25 30 



Glu Gly Arg Ala Phe Gly Ser "Leu Ala Ala Glri Pro lie Thr Ser' Cys 
35 - 40 45 



Leu Ser Val Pro Ala Pro Pro Phe Ser Leu Gly Lys Leu Gin Asp Gly 
50 55 60 
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Leu Leu His He Thr Thr Cys Ser Phe Val Ala Pro Trp Asn Ser Leu 
65 70 75 80 



Ser Leu Ala Gin Arg Arg Gly Phe Thr Lys Thr Tyr Thr Val Gly Cys 
85 90 95 



Glu Glu Cys Thr Val Phe Pro Cys Leu Ser He Pro Cys Lys Leu Gin 
100 105 110 



Ser Gly Thr His Cys Leu Trp Thr Asp Gin Leu Leu Gin Gly Ser Glu 
115 120 125 



Lys Gly Phe Gin Ser Arg His Leu Ala Cys Leu Pro Arg Glu Pro Gly 
130 135 140 



Leu Cys Thr Trp Gin Ser Leu Arg Ser Gin He Ala 
145 150 155 



<210> 238 

<211> 226 

<212> PRT 

<213> Homo sapien 

<400> 238 

Met Ala Ala Ala Ala Ala Ala Ala Gly Ala Ala Gly Ser Ala Ala Pro 
15 10 15 



Ala Ala Ala Ala Gly Ala Pro Gly Ser Gly Gly Ala Pro Ser Gly Ser 
20 25 30 



Gin Gly Val Leu lie Gly Asp Arg Leu Tyr Ser Gly Val Leu He Thr 
35 40 45 



Leu Glu Asn Cys Leu Leu Pro Asp Asp Lys Leu Arg Phe Thr Pro Ser 
50 55 60 



Met Ser Ser Gly Leu Asp Thr Asp Thr Glu Thr Asp Leu Arg Val Val 
65 70 75 80 



Gly Cys Glu Leu He Gin Ala Ala Gly He Leu Leu Arg Leu Pro Gin 
85 90 95 



Val Ala Met Ala Thr Gly Gin Val Leu Phe Gin Arg Phe Phe Tyr Thr 
100 105 HO 
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Lys Ser Phe Val Lys His Ser Met Glu His Val Ser Met Ala Cys Val 
115 120 125 



His Leu Ala Ser Lys lie Glu Glu Ala Pro Arg Arg lie Arg Asp Val 
130 135 140 



lie Asn Val Phe His Arg Leu Arg Gin Leu Arg Asp Lys Lys Lys Pro 
145 150 155 160 



Val Pro Leu Leu Leu Asp Gin Asp Tyr Val Asn Leu Lys Asn Gin lie 
165 170 175 



lie Lys Ala Glu Arg Arg Val Leu Lys Glu Leu Gly Phe Cys Val His 
180 185 190 



Val Lys His Pro His Lys lie lie Val Met Tyr Leu Gin Val Leu Glu 
195 200 205 



Cys Glu Arg Asn Gin His Leu Val Gin Thr Ser Trp Val Ala Ser Glu 
210 215 220 



Gly Lys 
225 



<210> 239 

<211> 253 

<212> PRT 

<213> Homo sapien 

<400> 239 

Asp Ser Gin Asp Cys Leu Ala Leu Ser Pro Ser Asn Arg Leu Leu Arg 
15 10 15 



Gly Val Val Arg Leu Ser Arg Phe Ser Leu Asp Asn Ala Gly Gly Arg 
20 25 30 



Pro Gly Phe Pro Gly Gly Ala Leu Gin Phe Phe Leu Cys Leu Ala Ser 
35 40 45 



Arg Asn Tyr Met Asn Asp Ser Leu Arg Thr Asp Val Phe Val Arg Phe 
50 55 60 



Gin Pro Glu Ser lie Ala Cys Ala Cys lie Tyr' Leu Ala Ala" Arg Thr" 
65 70 75 80 



Leu Glu lie Pro Leu Pro Asn Arg Pro His Trp Phe Leu Leu Phe Gly 
85 90 95 



t 
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Ala Thr Glu Glu Glu lie Gin Glu lie Cys Leu Lys lie Leu Gin Leu 
100 105 110 



Tyr Ala Arg Lys Lys Val Asp Leu Thr His Leu Glu Gly Glu Val Glu 
115 120 125 



Lys Arg Lys His Ala lie Glu Glu Ala Lys Ala Gin Ala Arg Gly Leu 
130 135 140 



Leu Pro Gly Gly Thr Gin Val Leu Asp Gly Thr Ser Gly Phe Ser Pro 
145 150 155 160 



Ala Pro Lys Leu Val Glu Ser Pro Lys Glu Gly Lys Gly Ser Lys Pro 
165 170 175 



Ser Pro Leu Ser Val Lys Asn Thr LyB Arg Arg Leu Glu Gly Ala Lys 
180 185 190 



Lys Ala Lys Ala Asp Ser Pro Val Asn Gly Leu Pro Lys Gly Arg Glu 
195 200 205 



Ser Arg £er Arg' Ser Arg Ser Arg Glu Gin Ser Tyr Ser Arg Ser Pro 
210 215 220 



Ser Arg Ser Ala Ser Pro Lys Arg Arg Lys Ser Asp Ser Gly Ser Thr 
225 230 235 240 



Ser Gly Gly Ser Lys Ser Gin Arg Ser Leu Gin Arg Leu 
245 250 



<210> 240 

<211> 346 

<212> PRT 

<213> Homo sapien 

<400> 240 

Asp Ser Gin Asp Cys Leu Ala Leu Ser Pro Ser Asn Arg Leu Leu Arg 
15 10 15 



Gly Val Val Arg Leu Ser Arg Phe Ser Leu Asp Asn Ala Gly Gly Arg 
20 25 30 



Pro Gly Phe Pro Gly Gly Ala Leu Gin Phe Phe Leu Cys Leu Ala Ser 
35 40 45 
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Arg Asn Tyr Met Asn Asp Ser Leu Arg Thr Asp Val Phe Val Arg Phe 
50 55 60 



Gin Pro Glu Ser lie Ala Cys Ala Cys lie Tyr Leu Ala Ala Arg Thr 
65 70 75 80 



Leu Glu lie Pro Leu Pro Asn Arg Pro His Trp Phe Leu Leu Phe Gly 
85 90 95 



Ala Thr Glu Glu Glu lie Gin Glu lie Cys Leu Lys lie Leu Gin Leu 
100 105 110 



Tyr Ala Arg Lys Lys Val Asp Leu Thr His Leu Glu Gly Glu Val Glu 
115 ~ 120 125 



Lys Arg Lys His Ala lie Glu Glu Ala Lys Ala Gin Ala Arg Gly Leu 
130 135 140 



Leu Pro Gly Gly Thr Gin Val Leu Asp Gly Thr Ser Gly Phe Ser Pro 
145 150 155 160 



Ala Pro Lys Leu Val Glu Ser Pro Lys Glu Gly Lys Gly Ser Lys Pro 
165 170 175 



Ser Pro Leu Ser Val Lys Asn Thr Lys Arg Arg Leu Glu Gly Ala Lys 
180 185 190 



Lys Ala Lys Ala Asp Ser Pro Val Asn Gly Leu Pro Lys Gly Arg Glu 
195 200 205 



Ser Arg Ser Arg Ser Arg Ser Arg Glu Gin Ser Tyr Ser Arg Ser Pro 
210 " 215 220 



Ser Arg Ser Ala Ser Pro Lys Arg Arg Lys Ser Asp Ser Gly Ser Thr 
225 230 235 240 



Ser Gly Gly Ser Lys Ser Gin Ser Arg Ser Arg Ser Arg Ser Asp Ser 
245 250 255 



Pro Pro Arg Gin Ala Pro Arg Ser Ala Pro Tyr Lys Gly Ser Glu lie 
260 265 270 



Arg Gly Ser Arg Lys Ser Lys Asp Cys Lys Tyr Pro Gin Lys Pro His 
275 280 285 



Lys Ser Arg Ser Arg Ser Ser Ser Arg Ser Arg Ser Arg Ser Arg Glu 
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290 295 300 



Arg Ala Asp Asn Pro Gly Lys Tyr Lys Lys Lys Ser His Tyr Tyr Arg 
305 310 315 320 



Asp Gin Arg Arg Glu Arg Ser Arg Ser Tyr Glu Arg Thr Gly Arg Arg 
325 330 335 



Tyr Glu Arg Asp His Pro Val Ala Ala Leu 
340 345 



<210> 241 

<211> 91 

<212> PRT 

<213> Homo sapien 

<400> 241 

Pro Thr Thr Thr Lys Phe Ala Ala Ala Ser Thr Phe Leu Asn Trp Cys 
15 10 15 



Cys Leu Gly Phe lie Ala Phe Ala Tyr Ser Val Lys Ser Arg Asp Arg 
20 25 30 



Lys Met Val Gly Asp Val Thr Gly Ala Gin Ala Tyr Ala Ser Thr Ala 
35 40 45 



Lys Cys Leu Asn lie Trp Ala Leu He Leu Gly He Phe Met Thr He 
50 55 60 



Gly Phe He Leu Leu Leu Val Phe Gly Ser Val Thr Val Tyr His He 
65 70 75 80 



Met Leu Gin He He Gin Glu Lys Arg Gly Tyr 
85 90 



<210> 242 

<211> 92 

<212> PRT 

<213> Homo sapien 

<400> 242 

Gly Gin Glu Asp Gly Trp Arg Arg Asp Arg Gly Pro Gly Leu Cys Leu 
15 10 15 



His Arg Gin Val Pro Glu His Leu Gly Pro Asp Phe Gly His Leu His 
20 25 * 30 
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Asp Arg lie His Pro Val Thr Gly lie Arg Leu Cys Asp Ser Leu Pro 
35 40 45 



Tyr Tyr Val Thr Asp Asn Thr Gly Lys Thr Gly Leu Leu Val Ala Ala 
50 55 60 



His Ser Leu Gin Pro Leu His Ser Thr Val Gin Cys Trp Pro Cys Thr 
65 70 75 80 



Leu Gly Leu Leu Pro Leu Pro Pro Trp Ser Cys Pro 
85 90 



<210> 243 

<211> 137 

<212> PRT 

<213> Homo sapien 

<400> 243 

Met Leu Glu Arg Arg Ser Val Met Asp Arg Pro Pro Ala Glu Val Arg 
1 5 10 15 



Glu Thr Lys He Lys Gly Lys Ser Gly Arg Phe Phe Thr Val Lys Leu 
2 0 25 30 



Pro Val Ala Leu Asp Pro Gly Ala Lys He Ser Val He Val Glu Thr 
35 40 45 



Val Tyr Thr His Val Leu His Pro Tyr Pro Thr Gin He Thr Gin Ser 
50 55 60 



Glu Lys Gin Phe Val Val Phe Glu Gly Asn His Tyr Phe Tyr Ser Pro 
65 70 75 80 



Tyr Pro Thr Lys Thr Gin Thr Met Arg Val Lys Leu Ala Ser Arg Asn 
85 90 95 



Val Glu Ser Tyr Thr Lys Leu Gly Asn Pro Thr Arg Ser Glu Asp Leu 
100 105 110 



Leu Asp Tyr Gly Pro Phe Arg Asp Val Pro Ala Tyr Ser Gin Asp Thr 
115 12 0 125 



Phe Lys Val Pro Arg Pro Arg Pro Arg 
130 135 



<210> 244 
<211> 148 
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<212> PRT 

<213> Homo sapien 

<220> 

<221> MIS COFEATURE 

<222> (21) . . (22) 

<223> X=any amino acid 



<220> 

<221> MI S COFEATURE 

<222> (24) . . (24) 

<223> X=any amino acid 



<400> 244 



Arg Leu lie Tyr Arg Ala He Gly His Leu He Met Leu Glu Arg Arg 
1 5 10 15 



Ser Val Met Asp Xaa Xaa Pro Xaa Glu Val Arg Glu Thr Lys He Lys 
20 25 30 



Gly Lys Ser Gly Arg Phe Phe Thr Val Lys Leu Pro Val Ala Leu Asp 
35 40 45 



Pro Gly Ala Lys He Ser Val He Val Glu Thr Val Tyr Thr His Val 
50 55 60 



Leu His Pro Tyr Pro Thr Gin He Thr Gin Ser Glu Lys Gin Phe Val 
65 70 75 80 



Val Phe Glu Gly Asn His Tyr Phe Tyr Ser Pro Tyr Pro Thr Lys Thr 
85 90 95 



Gin Thr Met Arg Val Lys Leu Ala Ser Arg Asn Val Glu Ser Tyr Thr 
100 105 110 



Lys Leu Gly Asn Pro Thr Arg Ser Glu Asp Leu Leu Asp Tyr Gly Pro 
115 120 125 



Phe Arg Asp Val Pro Ala Tyr Ser Gin Asp Thr Phe Lys Val Pro Arg 
130 135 140 



Pro Arg Pro Arg 
145 



<210> 245 

<211> 479 

<212> PRT 

<213> Homo sapien 
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<400> 245 



Met Glu Ala Pro Ala Ala Gly Leu Phe Leu Leu Leu Leu Leu Gly Thr 
15 10 15 

Trp Ala Pro Ala Pro Gly Ser Ala Ser Ser Glu Ala Pro Pro Leu lie 
20 25 30 



Asn Glu Asp Val Lys Arg Thr Val Asp Leu Ser Ser His Leu Ala Lys 
35 40 45 

Val Thr Ala Glu Val Val Leu Ala His Leu Gly Gly Gly Ser Thr Ser 
50 55 60 

Arg Ala Thr Ser Phe Leu Leu Ala Leu Glu Pro Glu Leu Glu Ala Arg 
65 70 75 80 

Leu Ala His Leu Gly Val Gin Val Lys Gly Glu Asp Glu Glu Glu Asn 
85 90 95 

Asn Leu Glu Val Arg Glu Thr Lys lie Lys Gly Lys Ser Gly Arg Phe 
100 105 110 

Phe Thr Val Lys Leu Pro Val Ala Leu Asp Pro Gly Ala Lys lie Ser 
115 120 125 

Val lie Val Glu Thr Val Tyr Thr His Val Leu His Pro Tyr Pro Thr 
130 135 140 

Gin lie Thr Gin Ser Glu Lys Gin Phe Val Val Phe Glu Gly Asn His 
145 150 155 160 

Tyr Phe Tyr Ser Pro Tyr Pro Thr Lys Thr Gin Thr Met Arg Val Lys 
165 170 175 



Leu Ala Ser Arg Asn Val Glu Ser Tyr Thr Lys Leu Gly Asn Pro Thr 
180 185 190 



Arg Ser Glu Asp Leu Leu Asp Tyr Gly Pro Phe Arg Asp Val Pro Ala 

195 200 205 

Tyr Ser Gin Asp Thr Phe Lys Val His Tyr Glu Asn Asn Ser Pro Phe 
210 215 220 



Leu Thr He Thr Ser Met Thr Arg Val He Glu Val Ser His Trp Gly 
225 230 235 240 
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Asn lie Ala Val Glu Glu Asn Val Asp Leu Lys His Thr Gly Ala Val 
245 250 255 



Leu Lys Gly Pro Phe Ser Arg Tyr Asp Tyr Gin Arg Gin Pro Asp Ser 
260 265 270 



Gly He Ser Ser He Arg Ser Phe Lys Thr He Leu Pro Ala Ala Ala 
275 280 285 



Gin Asp Val Tyr Tyr Arg Asp Glu He Gly Asn Val Ser Thr Ser His 
290 295 300 



Leu Leu He Leu Asp Asp Ser Val Glu Met Glu He Arg Pro Arg Phe 
305 310 315 320 



Pro Leu Phe Gly Gly Trp Lys Thr His Tyr He Val Gly Tyr Asn Leu 
325 330 335 



Pro Ser Tyr Glu Tyr Leu Tyr Asn Leu Gly Asp Gin Tyr Ala Leu Lys 
340 345 350 



Met Arg Phe Val Asp His Val Phe Asp Glu Gin Val He Asp Ser Leu 
355 360 365 



Thr Val Lys He He Leu Pro Glu Gly Ala Lys Asn He Glu He Asp 
370 375 380 



Ser Pro Tyr Glu He Ser Arg Ala Pro Asp Glu Leu His Tyr Thr Tyr 
385 390 395 400 



Leu Asp Thr Phe Gly Arg Pro Val He Val Ala Tyr Lys Lys Asn Leu 
405 410 415 



Val Glu Gin His He Gin Asp He Val Leu Asp Ala Gin Val Lys Glu 
420 425 430 



Leu Val Leu Lys Ser Ala Val Glu Ala Glu Arg Leu Val Ala Gly Lys 
435 440 445 



Leu Lys Lys Asp Thr Tyr He Glu Asn Glu Lys Leu He Ser Gly Lys 
450 455 460 



Arg Gin Glu Leu Val Thr Lys He Asp His He Leu Asp Ala Leu 
465 470 475 
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<210> 246 
<211> 361 
<212> PRT 
<213> Homo sapien 

<400> 246 

Met Glu Ala Pro Ala Ala Gly Leu Phe Leu Leu Leu Leu Leu Gly Thr 
15 10 15 



Trp Ala Pro Ala Pro Gly Ser Ala Ser Ser Glu Ala Pro Pro Leu lie 
20 25 3 0 



Asn Glu Asp Val Lys Arg Thr Val Asp Leu Ser Ser His Leu Ala Lys 
35 40 45 



Val Thr Ala Glu Val Val Leu Ala His Leu Gly Gly Gly Ser Thr Ser 
50 55 60 



Arg Ala Thr Ser Phe Leu Leu Ala Leu Glu Pro Glu Leu Glu Ala Arg 
65 70 75 80 



Leu Ala His Leu Gly Val Gin Val Lys Gly Glu Asp Glu Glu Glu Asn 
85 90 95 



Asn Leu Glu Val Arg Glu Thr Lys lie Lys Gly Lys Ser Gly Arg Phe 
100 105 110 



Phe Thr Val Lys Leu Pro Val Ala Leu Asp Pro Gly Ala Lys lie Ser 
115 120 125 



Val lie Val Glu Thr Val Tyr Thr His Val Leu His Pro Tyr Pro Thr 
130 135 140 



Gin lie Thr Gin Ser Glu Lys Gin Phe Val Val Phe Glu Gly Asn His 
145 150 155 160 



Tyr Phe Tyr Ser Pro Tyr Pro Thr Lys Thr Gin Thr Met Arg Val Lys 
165 170 175 



Leu Ala Ser Arg Asn Val Glu Ser Tyr Thr Lys Leu Gly Asn Pro Thr 
180 185 190 



Arg Ser Glu Asp Leu Leu Asp Tyr Gly Pro Phe Arg Asp Val Pro Ala 
195 200 205 



Tyr Ser Gin Asp Thr Phe Lys Val His Tyr Glu Asn Asn Ser Pro Phe 
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210 215 220 



Leu Thr lie Thr Ser Met Thr Arg Val lie Glu Val Ser His Trp Gly 
225 230 235 240 



Asn lie Ala Val Glu Glu Asn Val Asp Leu Lys His Thr Gly Ala Val 
245 250 255 



Leu Lys Gly Pro Phe Ser Arg Tyr Asp Tyr Gin Arg Gin Pro Asp Ser 
260 265 270 



Gly lie Ser Ser lie Arg Ser Phe Lys Thr lie Leu Pro Ala Ala Ala 
275 280 285 



Gin Asp Val Tyr Tyr Arg Asp Glu lie Gly Asn Val Ser Thr Ser His 
290 295 300 



Leu Leu He Leu Asp Asp Ser Val Glu Met Glu He Arg Pro Arg Phe 
305 310 315 320 



Pro Leu Phe Gly Gly Trp Lys Thr His Tyr He Val Gly Tyr Asn Leu 
325 330 335 



Pro Ser Tyr Glu Tyr Leu Tyr Asn Leu Gly Gin Ser Ser He Val Arg 
340 345 350 



Glu Lys Leu Thr Phe Ser Leu He Ser 
355 360 



<210> 247 

<211> 420 

<212> PRT 

<213> Homo sapien 

<400> 247 

Met Glu Ala Pro Ala Ala Gly Leu Phe Leu Leu Leu Leu Leu Gly Thr 
1 5 10 15 



Trp Ala Pro Ala Pro Gly Ser Ala Ser Ser Glu Ala Pro Pro Leu He 
20 25 30 



Asn Glu Asp Val Lys Arg Thr Val Asp Leu Ser Ser His Leu Ala Lys 
35 40 45 



Val Thr Ala Glu Val Val Leu Ala His Leu Gly Gly Gly Ser Thr Ser 
50 55 60 
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Arg Ala Thr Ser Phe Leu Leu Ala Leu Glu Pro Glu Leu Glu Ala Arg 
65 70 75 80 



Leu Ala His Leu Gly Val Gin Val Lys Gly Glu Asp Glu Glu Glu Asn 
85 90 95 



Asn Leu Glu Val Arg Glu Thr Lys lie Lys Gly Lys Ser Gly Arg Phe 
100 105 110 



Phe Thr Val Lys Leu Pro Val Ala Leu Asp Pro Gly Ala Lys lie Ser 
115 120 125 



Val lie Val Glu Thr Val Tyr Thr His Val Leu His Pro Tyr Pro Thr 
130 135 140 



Gin lie Thr Gin Ser Glu Lys Gin Phe Val Val Phe Glu Gly Asn His 
145 150 155 160 



Tyr Phe Tyr Ser Pro Tyr Pro Thr Lys Thr Gin Thr Met Arg Val Lys 
165 " 170 175 



Leu Ala Ser Arg Asn Val Glu Ser Tyr Thr Lys Leu Gly Asn Pro Thr 
180 185 190 



Arg Ser Glu Asp Leu Leu Asp. Tyr Gly Pro Phe Arg Asp Val Pro Ala 
195 200 205 



Tyr Ser Gin Asp Thr Phe Lys Val His Tyr Glu Asn Asn Ser Pro Phe 
210 215 220 



Leu Thr lie Thr Ser Met Thr Arg Val He Glu Val Ser His Trp Gly 
225 230 235 240 



Asn He Ala Val Glu Glu Asn Val Asp Leu Lys His Thr Gly Ala Val 
245 250 255 



Leu Lys Gly Pro Phe Ser Arg Tyr Asp Tyr Gin Arg Gin Pro Asp Ser 
260 265 270 



Gly He Ser Ser He Arg Ser Phe Lys Thr He Leu Pro Ala Ala Ala 
275 280 285 



Gin Asp Val Tyr Tyr Arg Asp Glu He Gly Asn Val Ser Thr Ser His 
290 295 300 
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Leu Leu lie Leu Asp Asp Ser Val Glu Met Glu He Arg Pro Arg Phe 
305 310 315 320 



Pro Leu Phe Gly Gly Trp Lys Thr His Tyr He Val Gly Tyr Asn Leu 
325 330 335 



Pro Ser Tyr Glu Tyr Leu Tyr Asn Leu Gly Asp Gin Tyr Ala Leu Lys 
340 345 350 



Met Arg Phe Val Asp His Val Phe Asp Glu Gin Val He Asp Ser Leu 
355 360 365 



Thr Val Lys He He Leu Pro Glu Gly Ala Lys Thr Thr Ser Trp Met 
370 375 380 

Pro Cys Ser Pro Cys Pro His Pro Pro Gly Gly Pro Gly Cys Leu His 
385 390 395 400 



Phe Ala Val Ala Gly Arg Leu Gly Gly Ser Gly Arg Leu Cys Met Glu 
405 410 415 



Ala Ser Glu Ser 
420 



<210> 248 

<211> 128 

<212> PRT 

<213> Homo sapien 

<400> 248 

Gly Cys Ala Arg Glu Pro Glu Ser Arg Leu Pro Lys Leu Gly Ser Trp 
15 10 15 



Glu Asn Leu Gly Pro Gly Leu Thr Glu Lys Arg Arg Gly Lys Glu Ala 
20 25 30 



Gly Gin Glu Glu Gly Ala Trp Arg Thr Pro Ala Gly Gly Arg Gly Ala 
35 4 0 45 



Ala Gly Leu Ser Val Thr Pro Leu Ser Pro Pro Arg Pro Ala Pro Pro 
50 55 60 



Ala Gly Glu Giy Pro Arg Cys Pro Pro Gly Arg Pro Ala Pro Ala Arg 
65 70 75 80 



Arg Arg Lys Gly Trp Arg Val Glu Arg Arg Gly Gly Arg Gly Ser Ala 
85 90 95 
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Trp Asp Ala Pro Gly His Arg Ala Arg Ser Leu Arg Pro Gly Ala Gly 
100 105 110 



Gin Val Arg Gly Gin Asp Val Gly Arg Thr Trp Cys Met Ala Ala Ser 
115 120 125 



<210> 249 

<211> 315 

<212> PRT 

<213> Homo sapien 

<400> 249 

Met Ser Ala Ala Gly Ala Gly Ala Gly Val Glu Ala Gly Phe Ser Ser 
15 10 15 



Glu Glu Leu Leu Ser Leu Arg Phe Pro Leu His Arg Ala Cys Arg Asp 
20 25 30 



Gly Asp Leu Ala Thr Leu Cys Ser Leu Leu Gin Gin Thr Pro His Ala 
35 40 45 



His Leu Ala Ser Glu Asp Ser Phe Tyr Gly Trp Thr Pro Val His Trp 
50 55 60 



Ala Ala His Phe Gly Lys Leu Glu Cys Leu Val Gin Leu Val Arg Ala 
65 70 75 80 



Gly Ala Thr Leu Asn Val Ser Thr Thr Arg Tyr Ala Gin Thr Pro Ala 
85 90 95 



His He Ala Ala Phe Gly Gly His Pro Gin Cys Leu Val Trp Leu He 
100 105 HO 



Gin Ala Gly Ala Asn He Asn Lys Pro Asp Cys Glu Gly Glu Thr Pro 
115 120 125 



He His Lys Ala Ala Arg Ser Gly Ser Leu Glu Cys He Ser Ala Leu 
130 135 140 



Val Ala Asn Gly Ala His Val Asp Leu Arg Asn Ala Ser Gly Leu Thr 
145 150 155 160 



Ala Ala Asp He Ala Gin Thr Gin Gly Phe Gin Glu Cys Ala Gin Phe 
165 170 175 
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Leu Leu Asn Leu Gin Asn Cys His Leu Asn His Phe Tyr Asn Asn Gly 
180 185 190 



lie Leu Asn Gly Gly His Gin Asn Val Phe Pro Asn His lie Ser Val 
195 200 205 



Gly Thr Asn Arg Lys Arg Cys Leu Glu Asp Ser Glu Asp Phe Gly Val 
210 215 220 



Lys Lys Ala Arg Thr Glu Ala Gin Ser Leu Asp Ser Ala Val Pro Leu 
225 230 235 240 



Thr Asn Gly Asp Thr Glu Asp Asp Ala Asp Lys Met His Val Asp Arg 
245 250 255 



Glu Phe Ala Val Val Thr Gly Gly Ser Gly Gin Phe Pro Val Ser Cys 
260 265 270 



Asn Asn Asn Pro Met Val Glu Asp Thr Lys Gin Gin Glu Ser Gly Ser 
275 280 285 



Val Gly Pro Lys Glu lie Glu lie Tyr Thr Val Ser Ala Met Gin Thr 
290 295 300 



Pro Cys Arg Cys Arg Asn Gin Tyr Glu Lys Gin 
305 310 315 



<210> 250 

<211> 142 

<212> PRT 

<213> Homo sapien 

<400> 250 

Met Gly Asn Phe Lys Ser lie Ser Thr Ser Thr Lys Met Val Asn Gly 
15 10 15 



Arg Lys lie Thr Thr Lys Arg lie Val Glu Asn Gly Gin Glu Arg Val 
20 25 30 



Glu Val Glu Glu Asp Gly Gin Leu Lys Ser Leu Thr lie Asn Gly Val 
35 40 45 



'Aia Asp Asp* Asp Ala Leu Ala Glu Glu Arg" Met Arg Arg Gly Gin Asn 
50 55 60 



Ala Leu Pro Ala Gin Pro Ala Gly Leu Arg Pro Pro Lys Pro Pro Arg 
65 70 75 80 
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Pro Ala Ser Leu Leu Arg His Ala Pro His Cys Leu Ser Glu Glu Glu 
85 90 95 



Gly Glu Gin Asp Arg Pro Arg Ala Pro Gly Pro Trp Asp Pro Leu Ala 
100 105 HO 

Ser Ala Ala Gly Leu Lys Glu Gly Gly Lys Arg Lys Lys Gin Lys Gin 
115 120 125 

Arg Glu Glu Ser Lys Lys Lys Lys Ser Thr Lys Gly Asn His 
130 135 140 



<210> 251 

<211> 72 

<212> PRT 

<213> Homo sapien 

<400> 251 

Met Gly Leu Ser His Ala Gly Trp His Arg Ala Gly Lys His Glu Ala 
15 10 15 

Ser Pro His Gin Gly Phe Ala Cys Arg Lys Ala Ala Leu Trp Pro Ala 
20 25 30 



Gly Glu Ala Glu Glu Thr Pro Val Asp Thr Leu Pro Thr Gly Leu Lys 
35 40 45 



Glu Gly Gly Lys Arg Lys Lys Gin Lys Gin Arg Glu Glu Ser Lys Lys 
50 55 60 



Lys Lys Ser Thr Lys Gly Asn His 
65 ^ 70 



<210> 252 

<211> 122 

<212> PRT 

<213> Homo sapien 

<400> 252 

Thr Gly Leu Glu Ala Arg Gly Ala Pro Pro Asp Ala Gly Ala Pro Pro 
15 10 15 

Cys Ser Ala Cys Gly Arg Ala His Ala Leu Gly Ser Ser Val Gly Gin 
20 25 30 



Asp Cys Leu Glu Ala Thr Leu Ala Arg Gin Asp Tyr Ala He Thr Asp 
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35 40 45 



Gin Ser Glu Gin Gly Gin Glu Thr Gly Leu Thr Ala Arg Val Ala Gly 
50 55 60 



Thr Asp Val Trp Asp Leu Ala Ala Thr Leu Cys Phe Ser Pro Ala Leu 
65 70 75 80 



Asn Leu Leu His Phe Pro Leu Val Leu Pro Asp Pro Leu His Ser Phe 
85 90 95 



Arg Leu Leu Asn His Ser Ala Cys Cys Trp Asn lie Ser Gly Phe Arg 
100 105 110 



Ser Thr Gly Gly Arg Arg Trp Leu Thr Glu 
115 ~ 120 



<210> 253 

<211> 42 

<212> PRT 

<213> Homo sapien 

<400> 253 

Met Ala Lys Lys Ala Gly Leu Cys Leu Gly Gly Ser Arg Gin Gly Gly 
15 10 15 



Cys Gin Ser Gly Met Val Thr Gly Asn Glu Pro Arg Asp Leu Ala Leu 
20 25 30 



Ser His Pro Leu Ser Phe Val Gly Gly Leu 
35 40 



<210> 254 

<211> 260 

<212> PRT 

<213> Homo sapien 

<400> 254 

Val Phe Cys Ser Phe Phe Ala Glu Lys Glu Gin Gin Glu Ala lie Glu 
15 10 15 



His lie Asp Glu Val Gin Asn Glu lie Asp Arg Leu Asn Glu Gin Ala 
20 25 30 



Ser Glu Glu He Leu Lys Val Glu Gin Lys Tyr Asn Lys Leu Arg Gin 
35 40 45 
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Pro Phe Phe Gin Lys Arg Ser Glu Leu lie Ala Lys He Pro Asn Phe 
50 55 60 



Trp Val Thr Thr Phe Val Asn His Pro Gin Val Ser Ala Leu Leu Gly 
65 70 75 80 



Glu Glu Asp Glu Glu Ala Leu His Tyr Leu Thr Arg Val Glu Val Thr 
85 90 95 



Glu Phe Glu Asp He Lys Ser Gly Tyr Arg He Asp Phe Tyr Phe Asp 
100 105 HO 



Glu Asn Pro Tyr Phe Glu Asn Lys Val Leu Ser Lys Glu Phe His Leu 
115 120 125 



Asn Glu Ser Gly Asp Pro Ser Ser Lys Ser Thr Glu He Lys Trp Lys 
130 135 140 



Ser Gly Lys Asp Leu Thr Lys Arg Ser Ser Gin Thr Gin Asn Lys Ala 
145 * 150 155 160 



Ser Arg Lys Arg Gin His Glu Glu Pro Glu Ser Phe Phe Thr Trp Phe 
165 170 175 



Thr Asp His Ser Asp Ala Gly Ala Asp Glu Leu Gly Glu Val He Lys 
180 185 190 



Asp Asp He Trp Pro Asn Pro Leu Gin Tyr Tyr Leu Val Pro Asp Met 
195 ~ 200 205 



Asp Asp Glu Glu Gly Glu Gly Glu Glu Asp Asp Asp Asp Asp Glu Glu 
210 215 220 



Glu Glu Gly Leu Glu Asp He Asp Glu Glu Gly Asp Glu Asp Glu Gly 
225 230 235 240 



Glu Glu Asp Glu Asp Asp Asp Glu Gly Glu Glu Gly Glu Glu Asp Glu 
245 250 255 



Gly Glu Asp Asp 
260 



<210> 255 

<211> 285 

<212> PRT 

<213> Homo sapien 
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<400> 255 

Ser Leu Gin Asp Lys Arg Ala Pro He Pro Glu His Thr Pro Phe Ser 
15 10 15 

Ser Ser Pro Phe Cys Ala Ser Leu Leu Ser Asp Leu He Val Ala Pro 
20 25 30 

Lys Lys Glu Gin Gin Glu Ala He Glu His He Asp Glu Val Gin Asn 
35 40 45 

Glu He Asp Arg Leu Asn Glu Gin Ala Ser Glu Glu He Leu Lys Val 
50 ~ 55 60 

Glu Gin Lys Tyr Asn Lys Leu Arg Gin Pro Phe Phe Gin Lys Arg Ser 
65 ' ~ 70 75 80 

Glu Leu He Ala Lys He Pro Asn Phe Trp Val Thr Thr Phe Val Asn 
85 90 95 



His Pro Gin Val Ser Ala Leu Leu Gly Glu Glu Asp Glu Glu Ala Leu 
100 105 HO 



His Tyr Leu Thr Arg Val Glu Val Thr Glu Phe Glu Asp He Lys Ser 
115 ^ 120 125 

Gly Tyr Arg He Asp Phe Tyr Phe Asp Glu Asn Pro Tyr Phe Glu Asn 
130 135 140 

Lys Val Leu Ser Lys Glu Phe His Leu Asn Glu Ser Gly Asp Pro Ser 
145 150 155 160 

Ser Lys Ser Thr Glu He Lys Trp Lys Ser Gly Lys Asp Leu Thr Lys 
165 170 175 

Arg Ser Ser Gin Thr Gin Asn Lys Ala Ser Arg Lys Arg Gin His Glu 
180 185 190 

Glu Pro Glu Ser Phe Phe Thr Trp Phe Thr Asp His Ser Asp Ala Gly 
195 200 205 

Ala Asp Glu Leu Gly Glu Val He Lys Asp Asp lie Trp Pro Asn Pro 
210 215 220 

Leu Gin Tyr Tyr Leu Val Pro Asp Met Asp Asp Glu Glu Gly Glu Gly 
225 230 235 240 
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Glu Glu Asp Asp Asp Asp Asp Glu Glu Glu Glu Gly Leu Glu Asp lie 
245 250 255 



Asp Glu Glu Gly Asp Glu Asp Glu Gly Glu Glu Asp Glu Asp Asp Asp 
260 265 270 



Glu Gly Glu Glu Gly Glu Glu Asp Glu Gly Glu Asp Asp 
275 280 285 



<210> 256 

<211> 600 

<212> PRT 

<213> Homo sapien 

<400> 256 

Met Ala Thr Pro Leu Pro Gly Arg Ala Gly Gly Pro Ala Thr Pro Leu 
15 10 15 



Ser Pro Thr Arg Leu Ser Arg Leu Gin Glu Lys Glu Glu Leu Arg Glu 
20 25 30 



Leu Asn Asp Arg Leu Ala His Tyr He Asp Arg Val Arg Ala Leu Glu 
35 40 45 



Leu Glu Asn Asp Arg Leu Leu Leu Lys He Ser Glu Lys Glu Glu Val 
50 55 60 



Thr Thr Arg Glu Val Ser Gly He Lys Ala Leu Tyr Glu Ser Glu Leu 
65 70 75 80 



Ala Asp Ala Arg Arg Val Leu Asp Glu Thr Ala Arg Glu Arg Ala Arg 
85 90 95 



Leu Gin He Glu He Gly Lys Leu Arg Ala Glu Leu Asp Glu Val Asn 
100 105 110 



Lys Ser Ala Lys Lys Arg Glu Gly Glu Leu Thr Val Ala Gin Gly Arg 
115 120 125 



Val Lys Asp Leu Glu Ser Leu Phe His Arg Ser Glu Val Glu Leu Ala 
130 * 135 140 



Ala Ala Leu Ser Asp Lys Arg Gly Leu Glu Ser Asp Val Ala Glu Leu 
145 150 155 160 



Arg Ala Gin Leu Ala Lys Ala Glu Asp Gly His Ala Val Ala Lys Lys 
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165 170 175 



Gin Leu Glu Lys Glu Thr Leu Met Arg Val Asp Leu Glu Asn Arg Cys 
180 185 190 



Gin Ser Leu Gin Glu Glu Leu Asp Phe Arg Lys Ser Val Phe Glu Glu 
195 200 205 



Glu Val Arg Glu Thr Arg Arg Arg His Glu Arg Arg Leu Val Glu Val 
210 215 220 



Asp Ser Ser Arg Gin Gin Glu Tyr Asp Phe Lys Met Ala Gin Ala Leu 
225 230 235 240 



Glu Glu Leu Arg Ser Gin His Asp Glu Gin Val Arg Leu Tyr Lys Leu 
245 250 255 



Glu Leu Glu Gin Thr Tyr Gin Ala Lys Leu Asp Ser Ala Lys Leu Ser 
260 265 270 



Ser Asp Gin Asn Asp Lys Ala Ala Ser Ala Ala Arg Glu Glu Leu Lys 
275 280 285 



Glu Ala Arg Met Arg Leu Glu Ser Leu Ser Tyr Gin Leu Ser Gly Leu 
290 295 300 



Gin Lys Gin Ala Ser Ala Ala Glu Asp Arg lie Arg Glu Leu Glu Glu 
305 " 310 315 320 



Ala Met Ala Gly Glu Arg Asp Lys Phe Arg Lys Met Leu Asp Ala Lys 
325 330 335 



Glu Gin Glu Met Thr Glu Met Arg Asp Val Met Gin Gin Gin Leu Ala 
340 345 350 



Glu Tyr Gin Glu Leu Leu Asp Val Lys Leu Ala Leu Asp Met Glu lie 
355 360 365 



Asn Ala Tyr Arg Lys Leu Leu Glu Gly Glu Glu Glu Arg Leu Lys Leu 
370 375 380 



Ser Pro Ser Pro Ser Ser" Arg Vai Thr Val Ser Arg Ala Thr Ser Ser" 
385 390 395 400 



Ser Ser Gly Ser Leu Ser Ala Thr Gly Arg Leu Gly Arg Ser Lys Arg 
405 410 415 
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Lys Arg Leu Glu Val Glu Glu Pro Leu Gly Ser Gly Pro Ser Val Leu 
420 425 430 



Gly Thr Gly Thr Gly Gly Ser Gly Gly Phe His Leu Ala Gin Gin Ala 
435 440 445 



Ser Ala Ser Gly Ser Val Ser lie Glu Glu lie Asp Leu Glu Gly Lys 
450 " 455 460 



Phe Val Gin Leu Lys Asn Asn Ser Asp Lys Asp Gin Ser Leu Gly Asn 
465 470 475 480 



Trp Arg lie Lys Arg Gin Val Leu Glu Gly Glu Glu lie Ala Tyr Lys 
485 490 495 



Phe Thr Pro Lys Tyr He Leu Arg Ala Gly Gin Met Val Thr Val Trp 
500 505 510 



Ala Ala Gly Ala Gly Val Ala His Ser Pro Pro Ser Thr Leu Val Trp 
515 520 525 



Lys Gly Gin Ser Ser Trp Gly Thr Gly Glu Ser Phe Arg Thr Val Leu 
530 535 540 



Val Asn Ala Asp Gly Glu Glu Val Ala Met Arg Thr Val Lys Lys Ser 
545 550 555 560 



Ser Val Met Arg Glu Asn Glu Asn Gly Glu Glu Glu Glu Glu Glu Ala 
565 570 575 



Glu Phe Gly Glu Glu Asp Leu Phe His Gin Gin Gly Asp Pro Arg Thr 
580 585 590 



Thr Ser Arg Gly Cys Tyr Val Met 
595 600 



<210> 257 

<211> 620 

<212> PRT 

<213> Homo sapien 

<400> 257 



Met Ser Pro Pro Ser Pro Gly Arg Arg Arg Glu Gin Arg Arg Pro Arg 
15 10 15 
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Ala Ala Ala Thr Met Ala Thr Pro Leu Pro Gly Arg Ala Gly Gly Pro 
20 25 30 



Ala Thr Pro Leu Ser Pro Thr Arg Leu Ser Arg Leu Gin Glu Lys Glu 
35 40 45 

Glu Leu Arg Glu Leu Asn Asp Arg Leu Ala His Tyr lie Asp Arg Val 
50 55 60 

Arg Ala Leu Glu Leu Glu Asn Asp Arg Leu Leu Leu Lys lie Ser Glu 
65 70 75 80 



Lys Glu Glu Val Thr Thr Arg Glu Val Ser Gly lie Lys Ala Leu Tyr 
85 90 95 



Glu Ser Glu Leu Ala Asp Ala Arg Arg Val Leu Asp Glu Thr Ala Arg 
100 105 110 



Glu Arg Ala Arg Leu Gin lie Glu lie Gly Lys Leu Arg Ala Glu Leu 
115 120 125 

Asp Glu Val Asn Lys Ser Ala Lys Lys Arg Glu Gly Glu Leu Thr Val 
130 135 140 



Ala Gin Gly Arg Val Lys Asp Leu Glu Ser Leu Phe His Arg Ser Glu 
145 150 155 160 



Val Glu Leu Ala Ala Ala Leu Ser Asp Lys Arg Gly Leu Glu Ser Asp 
165 170 175 

Val Ala Glu Leu Arg Ala Gin Leu Ala Lys Ala Glu Asp Gly His Ala 
180 185 190 



Val Ala Lys Lys Gin Leu Glu Lys Glu Thr Leu Met Arg Val Asp Leu 
195 200 205 



Glu Asn Arg Cys Gin Ser Leu Gin Glu Glu Leu Asp Phe Arg Lys Ser 
210 215 220 



Val Phe Glu Glu Glu Val Arg Glu Thr Arg Arg Arg His Glu Arg Arg 
225 230 235 240 



Leu Val Glu Val Asp Ser Ser Arg Gin Gin Glu Tyr Asp Phe Lys Met 
245 250 255 



Ala Gin Ala Leu Glu Glu Leu Arg Ser Gin His Asp Glu Gin Val Arg 
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260 265 270 



Leu Tyr Lys Leu Glu Leu Glu Gin Thr Tyr Gin Ala Lys Leu Asp Ser 
275 280 285 



Ala Lys Leu Ser Ser Asp Gin Asn Asp Lys Ala Ala Ser Ala Ala Arg 
290 295 300 



Glu Glu Leu Lys Glu Ala Arg Met Arg Leu Glu Ser Leu Ser Tyr Gin 
305 310 315 320 



Leu Ser Gly Leu Gin Lys Gin Ala Ser Ala Ala Glu Asp Arg lie Arg 
325 330 335 



Glu Leu Glu Glu Ala' Met Ala Gly Glu Arg Asp Lys Phe Arg Lys Met 
340 345 350 



Leu Asp Ala Lys Glu Gin Glu Met Thr Glu Met Arg Asp Val Met Gin 
355 360 365 



Gin Gin Leu Ala Glu Tyr Gin Glu Leu Leu Asp Val Lys Leu Ala Leu 
370 375 380 



Asp Met Glu He Asn Ala Tyr Arg Lys Leu Leu Glu Gly Glu Glu Glu 
385 390 395 400 



Arg Leu Lys Leu Ser Pro Ser Pro Ser Ser Arg Val Thr Val Ser Arg 
405 410 415 



Ala Thr Ser Ser Ser Ser Gly Ser Leu Ser Ala Thr Gly Arg Leu Gly 
420 425 430 



Arg Ser Lys Arg Lys Arg Leu Glu Val Glu Glu Pro Leu Gly Ser Gly 
435 44 0 445 



Pro Ser Val Leu Gly Thr Gly Thr Gly Gly Ser Gly Gly Phe His Leu 
450 455 460 



Ala Gin Gin Ala Ser Ala Ser Gly Ser Val Ser He Glu Glu He Asp 
465 470 475 480 



Leu Glu "Gly Lys Phe "Val Glri Leu Lys Asn Ash Ser Asp Lys Asp Gin 
485 490 495 



Ser Leu Gly Asn Trp Arg He Lys Arg Gin Val Leu Glu Gly Glu Glu 
500 505 510 
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lie Ala Tyr Lys Phe Thr Pro Lys Tyr lie Leu Arg Ala Gly Gin Met 
515 520 525 



Val Thr Val Trp Ala Ala Gly Ala Gly Val Ala His Ser Pro Pro Ser 
530 535 540 



Thr Leu Val Trp Lys Gly Gin Ser Ser Trp Gly Thr Gly Glu Ser Phe 
545 550 555 560 



Arg Thr Val Leu Val Asn Ala Asp Gly Glu Glu Val Ala Met Arg Thr 
565 570 575 



Val Lys Lys Ser Ser Val Met Arg Glu Asn Glu Asn Gly Glu Glu Glu 
580 585 590 



Glu Glu Glu Ala Glu Phe Gly Glu Glu Asp Leu Phe His Gin Gin Gly 
595 600 605 



Asp Pro Arg Thr Thr Ser Arg Gly Cys Tyr Val Met 
610 615 620 



<210> 258 

<211> 237 

<212> PRT 

<213> Homo sapien 

<400> 258 

Met Ser Pro Pro Ser Pro Gly Arg Arg Arg Glu Gin Arg Arg Pro Arg 
15 10 15 



Ala Ala Ala Thr Met Ala Thr Pro Leu Pro Gly Arg Ala Gly Gly Pro 
20 25 30 



Ala Thr Pro Leu Ser Pro Thr Arg Leu Ser Arg Leu Gin Glu Lys Glu 
35 40 45 



Glu Leu Arg Glu Leu Asn Asp Arg Leu Ala His Tyr lie Asp Arg Val 
50 55 60 



Arg Ala Leu Glu Leu Glu Asn Asp Arg Leu Leu Leu Lys lie Ser Glu 
65 70 75 80 



Lys Glu Glu Val Thr Thr Arg Glu Val Ser Gly lie Lys Ala Leu Tyr 
85 90 95 
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Glu Ser Glu Leu Ala Asp Ala Arg Arg Val Leu Asp Glu Thr Ala Arg 
100 105 110 



Glu Arg Ala Arg Leu Gin lie Glu lie Gly Lys Leu Arg Ala Glu Leu 
115 120 125 



Asp Glu Val Asn Lys Ser Ala Lys Lys Arg Glu Gly Glu Leu Thr Val 
130 135 140 



Ala Gin Gly Arg Val Lys Asp Leu Glu Ser Leu Phe His Arg Ser Glu 
145 150 155 ■ 160 



Val Glu Leu Ala Ala Ala Leu Ser Asp Lys Arg Gly Leu Glu Ser Asp 
165 170 175 



Val Ala Glu Leu Arg Ala Gin Leu Ala Lys Ala Glu Asp Gly His Ala 
180 185 190 



Val Ala Lys Lys Gin Leu Glu Lys Gly Cys Pro Cys Ser Gin Lys Ser 
195 200 205 



Arg Ser His Val Asp Arg Gly Gly Arg lie Leu Pro Lys His Phe Leu 
210 215 220 



Leu Glu Ala Thr Pro Leu Cys Ser Gin Ser Gly Gly Trp 
225 230 235 



<210> 259 

<211> 620 

<212> PRT 

<213> Homo sapien 

<400> 259 

Met Ser Pro Pro Ser Pro Gly Arg Arg Arg Glu Gin Arg Arg Pro Arg 
15 10 15 



Ala Ala Ala Thr Met Ala Thr Pro Leu Pro Gly Arg Ala Gly Gly Pro 
20 25 30 



Ala Thr Pro Leu Ser Pro Thr Arg Leu Ser Arg Leu Gin Glu Lys Glu 
35 40 45 



Glu Leu Arg Glu Leu Asn Asp Arg Leu Ala His Tyr lie Asp Arg Val 
50 55 60 



Arg Ala Leu Glu Leu Glu Asn Asp Arg Leu Leu Leu Lys lie Ser Glu 
65 70 75 80 
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Lys Glu Glu Val Thr Thr Arg Glu Val Ser Gly lie Lys Ala Leu Tyr 
85 90 95 



Glu Ser Glu Leu Ala Asp Ala Arg Arg Val Leu Asp Glu Thr Ala Arg 
100 105 110 



Glu Arg Ala Arg Leu Gin lie Glu lie Gly Lys Leu Arg Ala Glu Leu 
115 120 125 



Asp Glu Val Asn Lys Ser Ala Lys Lys Arg Glu Gly Glu Leu Thr Val 
130 135 140 



Ala Gin Gly Arg Val Lys Asp Leu Glu Ser Leu Phe His Arg Ser Glu 
145 ' 150 155 160 



Val Glu Leu Ala Ala Ala Leu Ser Asp Lys Arg Gly Leu Glu Ser Asp 
165 170 175 



Val Ala Glu Leu Arg Ala Gin Leu Ala Lys Ala Glu Asp Gly His Ala 
180 185 190 



Val Ala Lys Lys Gin Leu Glu Lys Glu Thr Leu Met Arg Val Asp Leu 
195 200 205 



Glu Asn Arg Cys Gin Ser Leu Gin Glu Glu Leu Asp Phe Arg Lys Ser 
210 215 220 



Val Phe Glu Glu Glu Val Arg Glu Thr Arg Arg Arg His Glu Arg Arg 
225 230 235 240 



Leu Val Glu Val Asp Ser Ser Arg Gin Gin Glu Tyr Asp Phe Lys Met 
245 250 255 



Ala Gin Ala Leu Glu Glu Leu Arg Ser Gin His Asp Glu Gin Val Arg 
260 265 270 



Leu Tyr Lys Leu Glu Leu Glu Gin Thr Tyr Gin Ala Lys Leu Asp Ser 
275 280 285 



Ala Lys Leu Ser Ser Asp Gin Asn Asp Lys Ala Ala Ser Ala Ala Arg 
" 290 " - 295' " 300 



Glu Glu Leu Lys Glu Ala Arg Met Arg Leu Glu Ser Leu Ser Tyr Gin 
305 310 ^ 315 320 
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Leu Ser Gly Leu Gin Lys Gin Ala Ser Ala Ala Glu Asp Arg lie Arg 
325 330 335 



Glu Leu Glu Glu Ala Met Ala Gly Glu Arg Asp Lys Phe Arg Lys Met 
340 345 350 



Leu Asp Ala Lys Glu Gin Glu Met Thr Glu Met Arg Asp Val Met Gin 
355 360 365 



Gin Gin Leu Ala Glu Tyr Gin Glu Leu Leu Asp Val Lys Leu Ala Leu 
370 375 380 



Asp Met Glu He Asn Ala Tyr Arg Lys Leu Leu Glu Gly Glu Glu Glu 
385 390 395 400 



Arg Leu Lys Leu Ser Pro Ser Pro Ser Ser Arg Val Thr Val Ser Arg 
405 410 415 



Ala Thr Ser Ser Ser Ser Gly Ser Leu Ser Ala Thr Gly Arg Leu Gly 
420 425 430 



Arg Ser Lys Arg Lys Arg Leu Glu Val Glu Glu Pro Leu Gly Ser Gly 
435 440 445 



Pro Ser Val Leu Gly Thr Gly Thr Gly Gly Ser Gly Gly Phe His Leu 
450 455 460 



Ala Gin Gin Ala Ser Ala Ser Gly Ser Val Ser He Glu Glu He Asp 
465 470 475 480 



Leu Glu Gly Lys Phe Val Gin Leu Lys Asn Asn Ser Asp Lys Asp Gin 
485 490 495 



Ser Leu Gly Asn Trp Arg He Lys Arg Gin Val Leu Glu Gly Glu Glu 
500 505 510 



He Ala Tyr Lys Phe Thr Pro Lys Tyr He Leu Arg Ala Gly Gin Met 
515 520 525 



Val Thr Val Trp Ala Ala Gly Ala Gly Val Ala His Ser Pro Pro Ser 
530 535 540 



Thr Leu Val Trp Lys Gly Gin Ser Ser Trp Gly Thr Gly Glu Ser Phe 
545 550 555 560 
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Arg Thr Val Leu Val Asn Ala Asp Gly Glu Glu Val Ala Met Arg Thr 
565 570 575 

Val Lys Lys Ser Ser Val Met Arg Glu Asn Glu Asn Gly Glu Glu Glu 
580 585 590 

Glu Glu Glu Ala Glu Phe Gly Glu Glu Asp Leu Phe His Gin Gin Gly 
595 600 605 

Asp Pro Arg Thr Thr Ser Arg Gly Cys Tyr Val Met 
610 615 620 



